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Descrfptton 

Techn'icar Field 



The present 'mertfjon relates to an amino acid stequence of membrane protein occiufm in a tigfrt junction ftierdn- 
«fter refened to as "the TJ^ of a human, a dog and a mouse, and a DNA for encoding the amino acid sequence. 



Background Art: 



to In mufficelMar animals, the infomialion on cellular adhesion between ac^acent cells is deeply oancerned with the 
regulation and maintenance of vital phenomena such as cellular proliferation and differentialion, inf la Ttnation and can- 
cer metastasis., Jnt^ljular adhesion mdecUes which tate part in the adhesion of the cells frequently assemble 
together on the surfaces off these (^Is to fomn a specifically differentiated membrane region for the mlhesion Particu- 
larly, interceliular adhesion molecuJes such as cadherins are known to be firmly bound to cytoskeletori in the cytoplas- 

15 mic domain of epithelial cells. Such membrane regions are called InterceHular adhesion apparatus ami chiefly classified 
into tiie following four structures: gap juiction (QJ), adherens lunation (AJ). desmosome and tight juixlion (TJ). 

These adhesfen apparatus has first been identified uncler an electron microscope, and as a result of the investiga- 
tlon and research of constituent proteins, the importance of their physbtogical and pathological ijignifflcance has 
become the fpcus of increased interest The proteins thit ai-e called the so-called adhesion molecuiej specifically exist 

20 in these adhesion apparatus, and the adhesion molecules* of the AJ are cadherins and various kinds at cadherins such 
as fsK»dherin and P-cadherin have been identified so far [Takeichi.M eta]., "Science", Vol 251. 1451-1455 (I99i)l 
As adhesion molecules of the desmosome. desmoglein and d^smocolDn are known, and according tc> recent ^es. it 
has been elucWatoJ that their structures are ^lar to those of ttie cadherins (Buxton, a S. et al.. "J Cell Btof " Vol. 
121, p. 481-484. (1993)i: The adhesion molecdiBs of the are called connexin. and it is known thiil connein holds 

25 transmembrane domains at four different sites and botii of its. N-termirial and Otenninal protrude on the cytootesmic 
side of the membrane. 

The TJ is an intercellular adheston apparatus pecuRar to epithefial cells and endothelial ceils, w^a e the cell mem- 
branes of contiguous cells are seen completely tightly apposed. The TJ surrounds indivklual cells a t»J functions as a 
bamer to btock or regulate penneatlon of water-spluttf^ molecules between the luminal and basement nenferane *es 

30 of a ceU layer. K has also been described to act a§h a fence partitioning the cell membrane into apice J and basdaleral 
sides m order to majntafn the polar disCriUiiiun ctf such memDrane proietns as lon Channels and pumps as well as lipids 
on the eel ntemtyane [Schneeberger. E. a et al:. "Am. J. Ptiy^loL"; VoL 262, P. L647-L661 , (1992)' . Owing to these 
functions of the TJ. mifiae consisting of different f Md convx>sitions are fomied on the opposite sides" a cell layers so 
that the polarity of the cell layer is maintained: hence the TJ can be said to be a fundamental stnjclure of vital Impor- 

35 lance to multicellular organisms. : , . -7 

Howa/er. analysis of the molecular structure of the TJ has been less progressing, compared t>. other adhesfon 
apparatus. In feet it tes constituted a serious dirawback to tiie pursuit of molecular biotogieal researdi on the TJ tfet 
the TJ adhesion molecule itself has not been Wenfifiad yet . 
The present inventors have established a method for isolation of AJ from rat liver, and have Wentiffi^ad many proteins ^ 

40 such as radixln and 201 from this isolated AJ ftsukita. Sh. et al.. "Cum Qpia GelJ Btol.-, VbL 4. R il5i4-839. 11992)] 
From researches on ZOl and histologic findings for the AJ and the TJ. it can be presumed thatthe ff aleinsin the fltj" 
also contain protein of the TJ. In view of this, the present inventors have isolated AJ ftom chfck liver, pnapared a mpnor; 
clonaf antibody agafnst the AJ as the antigen, and carried out structural analysis of theTJ-constitujing prol^ using ijie 
antibody q)ecificany reacting with the TJ. As a result the present inventor has been successful m ttie sr jdural analysis 

45 of a novel constituent protein dissimilar to known proteins, and designated the protein as occfudin fFuruse M et at 
"1 CeU Biol.". VW. 123. PL 1777-1788, (1993)]. 

This chkk ocdudin is a 56KDa protein conposed of 504 amino acbs. characterized conspicuouiJh/ by transmem- 
brane domains at four sites in tine half of its N-terminus. with both tiie N- and C-tenninals faang the cyloplasm and with 
two extraceiuar loops. 

60 From siijsequent studies, occludin was inferred to be an important factor in the analysis of physiofogtoal function 
of the TJ at the cellular level as well as at the whole body level, and drew much attentnn of investigators. 

^to further study has progressed, nevertheless, since the sajd protein has its origin in the chicken speo'es greatfy 
remote from humans. Thus, structural analysis of ocdudin of human origin has been expected for the sake of elucida- 
tion of the physiological function arKi medical armlysis of the TJ. There fe as yet no report of success ).i the elucidation 

55 of human ocdudin despite woridwide competition in research fbr this purpose in this field. 
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SUMMARY OF THE INVENTION 

II is accordfingty an object oi the present Invention to provide amino acid sequences of huma i. canine and mouse 
occtudms and the DNAs encoding them. 

s In their report on the gene for human neuroiwl apoptosis InhOjctory protein (NAiP), Roy et al. documented occur- 
rence of a DNA fragment possessing a' base sequence analogous to the C^emunal region of dtmi< occludin in NAIP 
gene defetion mutants [R(y, N. et al., *Ceir. \W. 80. p. 167-178, (1995)]. To ascertain whether the said sequence actu- 
ally encoded, a part of human analogue of occfudin or not, the present inventor select.ed prinicira out of the base 
sequence analogous to that of chicKocdudin and made a scrupulous screening with acDNA Ifbrary of hunron inlestkial 

10 ephhefial cell strain TB4 as a tenplats lor PGR. The present inventor has thus succeoded in the luiafysis of the whole 
structure of huran ocdudSa Further, the inventor, has completed analyses of mouse and canine <>cdudins, prepared 
anti-ocdudin monoclonal antibodies, and verified with histologic staining that the occludins were transmennbrane type 
TJ proteins. 

15 Best Mode for Carrying out the Invention; 

The present Invention is concerned with anwio ac^ sequences of a human, ai canine and.a nunise occludin, DNAs 

for ancndino ih^m. anfa'-ocdudtn.Aoiibodlciv, an?l a gonotio anolyolo method utilicing them. Mure ^^jistiffcany, Itie present 
invention is directed to ttie following aspect 

20 

(1) A DNA for erKQding a hun^ CNxludIrt having an annino acid sequence descn'bed in Sequ aiice Na 1 . 

(2) A pNA,for,(^CQding a *uiman ooclu^ 

(3) A DNA described In Sequence Na 5 for encoding a canine occkidin having an annino add sequence describee^ 
in Sequence Na 2. 

25 (4) A DNA described in Sequence Na 6 for encoding a mouse ocfdudin having an amino acid & jquence descritied 
in Sequence Na 3. 

A human, a canine and a nrKJuse ocducSn having the amino add sequences desaibed in Sequence Noa 1. 2 
and 3, respectively. 

(6) An occludin variant having an amino add sequence in which ^ne or plural .amino adds ir^ the amino add 
30 sequence of eau^occkalin are added, deleted or substituted, and a DNA ^ 

(7) A vector which comprises any of DNAs for encoding a human, a canine or a mouse ocdiKiin or tor encoding 
their valiants.:, 

(a) A Jrarreformanl wWch holds the vector. . . r 

. (9) A melhodfor.manufacturing an ocdudin piptein which conprisesi the^teps of culfivating the transfarmant, and 
35 collecfing an eixpressed product . ^ . 

(1 0} A pNA prpbe^cpmprismg cpntatning the whofe or part of the t>ase sequence as defined iri j^uence Na 4, 5 , 
o ^,. --v i* ' ■ ■■■"■--:;..■■-.-..-■";■*' , . .''^^ - -J.^:-:,' • 

(1 1) A DNA primer cpmpn^ng containing part of the t»se sequence as defined In Sequence IMx 4, 5 or 6.. 

(1 2) A polyclonal antibody or a monodonat antibody spedflcally bindoig to a humaa a canine or a mouse occludin 
40 protein..,- - • , , ^ 

(13) An assay method and an assay reagont for oprdudn in a biological specfm^n. wherein ai anti-ocdudin anG- 
bodyisused. 

(14) , An analysis method of.an ocdudin gene In a btologlqal sp^dmen, wherein said DNAertme-i3r said DNA probe 
isusecL :f< ,■ - ■ ,■■ 

45 (15) A screening method of a drug affecting the e)qxesGiori of opdudip, wherein occludlrh^xpressing cells and an 
analyte are allowed to coexist, and an expression quantity of an opcludin gene of ^aid cells js tfien determined by 
theuseof apNAprirneroraONAprcba \. 

(16) An antisense DNAderived from a human ocdudin DNA. 

(1 7) A laboratory animal whose ocducfin DNA is knocked out 



so 



Detailed De scrfafion of the Invention: 

With the success of the present inventor rn identifying ocdudin analogues of nnammals, it is xxoh possible to struc- 
turaly and functfonally test the con^Uion and function of the TJ at the molecular level. The banicf and f^dng func- 
tions of the TJ and the related regulatory mechanisms can be analyzed through experiments in«)lving corrlrol of 
expression of the gene for ocdudin or inhilxtfon of the occludbi fuTKtion with either an antisense pri^l je or an antibody, 
using various types of cultured human, mouse and canine (MDCK) cella For oxanv>la it is now po sisible to deta-mne 
whether or not overexpresskxi of ocdudin cDNA gives rise to an increase in numbei; of TJ strands (uren in freeze-frac- 
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Hired r^icas and inddenlaHy to an augmentation of the barrier funcikja Purtherrrore. the present in^«ntion has made 
It possible to establish a simple saeening method for drugs affecting TJ function. For example, drugii affecting TJ func- 
tion can be screened using various types of c^fs expressing occfudin. t>y allowing the cell to react wilh a test artide and 
subsequently b)f measuring the amount of cellular ocdudin gene or ocdudin protein expresaoa The g ene analysis can 
5 becamedoutbyusingaDNApfobeorprbTierorotherdevk^^ theNorth- 
ern blot technique: or Southern blot t^rfirVique wherein RNA or.DNA extracted from a test sanple In the usual manner 
IS pretreated when necessary, then electrophoresed on a membrane or gel, and hybridized with a labeled DfMA probe 
and the polymerase chain reaction CPCR) technique wherein the o^ective DMA Is an^iied using p imers of about 20 
bases corresporeiing to the relevant sfte and with a genomic DNA or cDNA as the templata The occtudin protein can 
10 be quantrtatedt for example^ by the use of an antibody, : 

Moreov^, it is also made pdssible to ascertain how the TJ fbrnnafion is involved in the morphojisnesis of various 
organs and whether functional failure of the TJ has any relation to varloii^ pathologic states such as iTflammation and 
tumor metastasis, by preparing various types of mutant mice and ocdudin gene-knocked out mica rtie possibinty of 
controlling a TJ function, espedalty hs banier function, is also of interest in connection with dnig perrneatenty. Thus, it 
15 would be feasible to control the Wood^rain barrier via ui> or down-regulation of ocdudin synthesis in epithelial cells of 
the braia Control of the TJ function in the enteric epitheHal cells is necessary to regulate drug absorplr< tfi from the irrtes- 
tina ft wiir thus become possible to control drug absorption, particularly distribution to the brain tissue by adirfinistering' 
an effective mibs^ gdreened out of dKjgs affecting the TJ fuhctibh: Hertcte, the present invention is highly antich ^ 
pated for elucidation of the physiological mechanisms primarily of the bkxxHjrain torier. as well as few analysis «aa- 
so nosis and treatment of disease slates. 

The DNA provided by the present invention can be utilized in the analysiS'of genes for ocdudin prcrteins and of gene 
expression thereof by using apartofitasa primer or a probe. TTie terma part here denotes that the ofigonuddotide to 
be used as a primer of probe cort^rises containing at tedst a 10-relevem^base sequence; or prefenibly at le^st 
base sequence, or more preferably a corresponding potynudeotide comprising containing approxiTately 20- to 30- 
base seqi^ce based^on the DNA ^uence of the present inverifida As the probe, a higher macromolecular or eweh 
the whole DMA may be used. 



2S 



3S 



■mere IS a method utifeing anfisense DNA or antisense RNA as a means to control tfie function occludin The 
method IS intended to blod^ the flow of gene expression by interfering with the reading of genetic Infc^r natibh at any of 
the stages of gene expression such as DNA repflcation. transcriplfon and translation, and the anti?^se tedviique 
emptoys nudeic acid or its analoflue for the blod«ge (Widelrome. E. ed. Prospects for Antisense Mi deiti Add Ther- 
apy of Cancer and AIDa Wiley^U'ss. New Xbrk, 1991). The etudd&tioh off the ocdudin DNA accordiria to the present 
invention has made possWe the means for inhbiling ocdudui functions by the antisense metiiod. Tli.s length W ONA^ 
o^omer has bearing on the double slrand-fornting capacity, membrane penneability and base seq ji^nce spedficili/; 
and at least 6 nudleotides, c/r preferably at least 10 mers; i^ly 15 tb ^ mere, may be used. Approaiatd s&iiericei 
may be selected on the basis of Ihe DNA sequence of the present Invention, and verified by experirtiBTt^on. Usi^ir 
''^S^To^^^!!!?^^ modified at itfe phosphate group, suga#'mdety. arid 3* and 5^ tails in order tc. tiugment te ^ 
bih^ iOouK p.a. Anticancer Orig. Des., 5.585. 1991). Representative analogues are digophosphorotiioSe where one 
of toe oqrgen atoms of ttie Intemud^de phdsphodiesler groiiplSTeplaced t>y a suifiir atorrt, and bllgomtrthyteulfbnate 
whOT the Gdid oxyigen Is replace a rttfethyi group; all such analogues are remarkably sl^e to nu(*3ases: Besides 
such oSgomers as tiiose with acridine or polytysine bonded to tiiem and those containing N-melfihrthymldylate to 
increase slabHity Of the hytjrid doiflsiB stf&nd. are also used. These ofigomers can be synttieslzed by Irown chen^ 
fiyrtheticprocedures.AntisenseRNAderivedfromtheDNAofthepresenlinventionmayalsob^ . / 

The ocdudin inotein of the presefrf invention n^ bd utili2!ed tor pr^ation bf an antibddy uar g the whole or a;- 
partof It as an epitope, and tor use the antbody thus prepared as research and diagnostic reagents. Ilie term epitbi^ 
denotes an antigen detennlnant of pofypejrtide: Epitope i6 usually cbmprisfed xrf at least 6 ammo addii. dnd ft is known, 
that a polypeptide cbiteisting of 6 amnb ackte cbmWn^ with an antibddy (JP.Are0-500684). The aniijjenic pebttde bf 
me subiect protean sigrafres a polypeptide conprising a s^es of at least 6 amino acAte; preferably a sari^ of at least 
^'l?^' nrwe preferably a series of at least 15 aminoadds; of further preferably a series of a least 20 amino 
acttJs. based on the amno add sequence of the present Invention. Occliidin pros/ided by the present im enfiori is a pro- 
tein whidi. as inferred from its antino add sequ^ in analogy wrtti chid< ocdudin, possesses transmembrane 
domans at four sites in a half of its ^^terminal re^on, with the N- and Otemiinus fedng the cytoplasm; and whldi has 
hro extracellular loops. In the case of human occliKfin of which anvno add 89-135 and 196-243 regioi«: are presumed 
to be adracelluterty apposed, various antibodies may be prepared by selecting antigenic sites apprapiiaie for purposes 
and utnzed as a means to eluddate the TJ function and as a means to si^ipression by the antibody oMhe TJ func^ 
«rs a^so possible to utinze the partial peptides as a means to screen oorrpounds tor those capable of fc inding tothese 
pepttdes. 

Proteins having an amino dcid sequence of occfucfin of the present invention to which one or a piurafity of antirio 
actdsareadd6dorofwhidi one or a plurality of amino adds are deleted or substituted are also enccnpassed by 
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present irwention. 

(1) Preparation of cDIMA library and structural analysis of ocdudin 

6 Preparation of RN A may be carrierfout using human or animal cells (cell strains) as the raw mrterial, by Ibr exam- 
ple extraction with a nroced solution ol guanidine thiocyanate, a surfactant, a chelating agent and a reductant followed 
by phenol eDctraction. fractionation in organic solvents (ChamezynsW et al,. AnaL Biochem.. 162. 1 56, 1987) and sub- 
sequ^itly density, gradient ultracentrifugalion procedura Using the RNA thus obtained as a tenv3l£ite^ a double strand 
DMA is.prepared in theiisual manner such as by the cDNA synthesis technique (QuWer. U. etaL, Gene, 25, 263.1983) 

w wHh the use of random primers, reverse transcriptase, DNA polymerase, etc. A DNA library can kc prepared by inser- 
tion of the doiiile strand DNA obtained into a bacteriophage such as X zap or X gll l in the usual mann^-. Commerdal 
cDNA libraries may also be used. 

Acwfing to the report of Rcy et a);, ft is pos^ble to obtain a Df^ fragment presumed to bs^ ctccludin DMA origin 
by property selecting a primer region based on the base sequence analogous to the C-terminus of chick ocdudin- by 

15 ampltfication of the DNA wHh the PGR technique, and by tts subdonation. Subsequent screehlnii oMhe cDNA Irbrary 

with this DNA fraoment as a probe and analydft of th» baco coquonoo off tho done loolatod may. yield a wfvHeHtmi^ 
cDNA ferooctudln.The structure oithe base sequence is delemnined 1^ the Maxamr<SIba1 metha j (Maxam. A. M. and 
Gilbert W. Proc. Nallr Acad: Sd. USA. 74, 560. 1,977) or by lhedk*eoxynucleotidey>ain terminatcn method (Sanger, 
P., Proc. Natl. Acad Sd. USA, 74, 5463. 1977). The amino add sequence is thus deducedion tin* basis of the base 
20 sequence. These gene nranipulations can be perfc>rmed by the known usual methods, for e)ffinpli) in accoidance with 
thosedescribed In Molecular Cloning. A Laboratory Manuals* T. Maniafe elal. eds. (1989>,>CoW Sftrng Harbor labora- 
tory. . . 

(2) Preparation of antibodies 

To prepare the monoclonal antitxxJy according to the present invention, humani canine or moiJi» ocdudin Is used 
as the antigen, its complex wwth a carrier protein is prepared iff deemed necessary, and approprialo an^ls are Immu- 
nized by inoculation with the antigen. Antody-forming cells obtained from the spleen or lymphnodes of the above 
Immunized animate are fused with myeloma ceils, wherdtjy hybridomas produdng an anfflxxiy itrongly specific to 
30 ocdudin are selected to prepare the monodonalantifcKxJy The preparation procedure may be in iMcordance with the 
Known prior method 

As the inruminogen. any of such products as purified natural products and products prepared genetic recombi- 
nation technique or chemical synthesis may be used For preparation off ocdudin by the recombinant DNA technique, 
the cDNA encoding ocdudin can be religated to the promoter downstream off a vector appiopriato for expression of 

35 ocdudin by the. known method^ using restrlctioh enzymes and DNA ligase to obtain a recombln^t expression vector.^ 
Tho said vector is nonlmitattve insofar as It can boT^rcated and amplrHed in afiosL With regaid Ici the promoter and 
terminator, there is also no particular nmrtation as long as Ihey are accordant with tiie host used far e)9)ressIon of the 
base sequence encoding ocdudin; and appropriate combinations suited to the host fnay also Iji^ practiCablS. Thet 
recombinant expresaon vector ttius obtained is introduced into the host by the competent cell tecfnique (J. Mol. BioL. 

40 53, 154, 1970) orthe calcium phosphate procedure (Sdence, 221, 551. 1983) to prepare a transfannanL Such organ- 
isms as Ese^erichta coH and animals are used bb the host, and the transtarmant obttuned is ailtunicl inan appmpriate 
medun su^ed to the host. The culture incubation is carried out usually at a tempefatur&t>etween 21 rC arid 45?C and 
at pH between 5 awd 8,'with aeratJon^arKlAor stirring where required Isolstion and purification of ocdudin from the ci^ 
tured microorganisms or cells may be pertbmied by an appiopriate combinaeon of known metiiod^xrf isolation andpuri- 

45 ficafiai. These known methods include salting out organic solvent method dialysis, gel fiftratlon, isledrophoresis. ion. 
exchange chromatography, affini^chronratography. and reiforseiJhas^ . 

The immunogen. ocdudin, Is preferably to retain its whole stnrture but may be in the form of a fr agment or peptide 
having its partial structure; it iriay be appropriately selected from tiie whole amino acid sequence of c«cdudia For prei>- 
aration of the fragment or peptide, a method sudi as chemical synthesis, the above mentioned ger>e rsoomlMnalion pro- 

50 cedure or degradation of a naturally occurrir^ artide is employed 

Various condensing agents may be used for preparation of the immunogen-carrier protein con^jlex; such reagents 
as gkjtaraldehyde. cart>odmiide. and maleimide activated ester may be used. 

The carrier protein may be any of those commonly used such as bovine serum albuntin^ thyroijlofaulin and hemo- 
cyanin^ and usually the method wherein a ir to 5-fo!d quantity of a carrier protein Is coupled to antii jim is used 

55 Animals emptoyed for immunization include the mouse, lat rabWl, and guinea plgs..and inocufatic n is made by sub- 
cutaneous. Intrarrwscular or Intraperitoneal Injection. The administration of Immunogen nray be carri 9d out In the form 
of a rrnxture smth complete Freund acQuvant or with incomplete adjuvant, and is usuafly made once < j\rery 2 to 5 w^eks. 
Antajody-produdng cells obtained from the spleen or lymphnodes of the imnrwnized animals are fiis;ed wth myeloma 
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cefls anc* isolated as hybrydomas. The myeloma cefls used are those of mouse, rat or human origirt, preferably alloge- 
neic to the antigen-producing cells used but. in some instances, can be xenogeneia 

The manipulatjon of cell fusion can be conducted in accordance, for example, with the method <rf Milstein and KOh- 
ler (Nature, 256,495. 1975). Fusogens used include such agents as polyethylene glycol and Senctei virus, and the cell 

6 f uaon can tie made by incubatlnii anlibody^produd ng ceOs with myeloma cells in an approwrmte p opulation ratio of 1 :1 
to 10:1 at a temperature between 20 and 40^C. preferably between 30 and ST'^C, for about 1 to 10 minutes, using po^ 
yelhylene glycor(mcan molecular weight. 1 .000 to 4.000) usualy at a concentration of about 20 K) 50%. 

Various immunochemical methods can be used for screening hybridomas producing an anti-oix ludin antibody. The 
methods Include enzyme-nnked immunosorbent assay (EUSA) usng nticroplates coated with ocdudin, enzyme immu- 

10 noassay {EIA> using micoplatos coaled ^ith an anti-immunoglobuDn antibody, and Western blotting technique in which 
samples containing occludin are electrophoresed with the subsequent i^e of nltroceltulose trarsft^i membranes. 

Qones are obtained from these wells further by. for example^ Bmifing dilution. Screening and breeding of hybrido- 
mas ar6 performed in a culture nredium for anlrnal cells (o.g. HPW 1640) containing 1 0-20% fetal Ixiyineseaimiisualiy 
with added HAT (hypoxanttimeri ariiinopterin and thymidine). The done thus obtained is intraperilonaaHy transplanted 

IS into BALB/c rrvce previously dosed with bristan, arid ascites containing a high concentration of a ironodonal antibbdy 
is collected 10-14 days later so ttet the ascites can be used as a source for purif ication of the antitody. Furthermore, 
the doned hybridoma cells are cultured so that the cultured eels can be used as the source for puiificallon of the anfi- 
Ijody. Known methods lor; purificafion off. immunoglobulin may be used for feco^^erin^ the mohcclonal antibody; the 
recovery can be readily accomfidished for example by such means as amnrK>niunn sul^e fractionitticn, PEQ fractional 

20 tion, ethanoi fractiorratioh, utitizalson of: anion exchartgers. and affinity chromatography. - • 

With immunological melhdc^usingtheanti-ocdudin monoclonal antibody obtained in accordsir ce wth the present 
invention, it is possible to make qua&taGve and quantitative detennr^ration of ocdudin in bkrfogiciil spedmens. As the 
immurological methods, conventional methods such as Immunohistologic stabling, enzyme immjnoassay, agghitina- 
tiorr test, competitive assay, and sandwich technique may b6 applied to samples from tiiological !;|jedmens thiat has?B 

25 been appropriately processed, where required, ag. tscdation of cells and extraction. The immunohi stologic staining can 
be performed for example by the direct method i^ng a labeled antibody or the indirect method- usin(i a labeled antitKxiy 
directed to the antftxKty bound to target antigen: Any of such known labeling substances asfluon^icent agents, radio^ 
active substarices, enzyn>es, m^s and dyes may be used as lat>eling agent& 

The monoclonal antitxxSy of tine present invention may be used In the form of Fab* or Fab fragrn&nt after remcval of 

30 its Fc' or Fc region, or in the form of polymer of atlw fragment Furthermore, it may also t>e in tf-iC' form of a chimeric 
antibody, a humanized antibody, or a human antibody. 

EXAMPLE- ■ - - ' '■ 

35 The present invention will ncnv be illustrated In detail with specific embodiments by the Ibl' owing examples. Of < 
coursepthapresentinvenfion^hannotboliniHedtottiefollowingHxanfiples.^^ ;^ - .r. 

Example 1 Structural analysis of human occludtn ' ' 

40 Based on a base sequence of a part of human hJAlP-deTicient gene analogous tathe.C-tbrntirite of chick ocdilidin, 
PGR was performed 4isihg as primers the oligonucleotides of Sequence Nos:'7 and a Xgtl 1 cDNA Irorary was prepared 
by purifying poly(A)+RNAfrom a source consisting of hunmn intestinal epitheGal cell 6trajn>T84 anci Isy using TimeSaver 
cDNA Synthesis Kit (trade name; Phamriacia LKB Bfotechnotogy inc. )and QIQAPACK II Pactegirti;i Extract (SIratagend 
Inc.). The PCR was canned out wltfi tWs library as tiie template and with said two primers yielded a cDNA fragment of 

45 363 base pairs. ■ -. • 

This DNA fiagmenl.was DIQ4abelGd using DIG LabeTing Kit (trade narne; Boehringer Mannhtam), and said nbrary - 
was screened with it used as the proba As a result, three cDNA cfones were isolated, their insertion sites were cut and ' 
they were subdoned to pBluescript;SK(*). Of these, the two ctones phOc6 and phQc16 were prsiiunied to contain a 
total ORF, and tiiese two doned strands were analyzed fOr their base sequences, with the results demonstrating that 

60 said t>ase sequences encoded tiie whole structure of human occlucfin. The coding sequence wati determkied using a 
7-deaza Sequenase Version Deoxy Temrilnator Cyda Sequencing Kit (trade name; Applied Bicsystems). The base 
sequence is shown tn Sequence Na 4, and the annno acid sequence deduced therefrom in Sequence Na 1 . 

Structure of canine and mouse ocdudins were determined in the same manner as described above using Xgtl 1 
and xgtIO cONA Iforaries,. respectively; prepared from dog Mdney (MEX^IQ celte and mouse luiig cell& Tlie t>ase 

65 sequence and amino acid sequence of canine oodudln are shown in Sequence Nos. 5 and 2, and the base sequence 
arxf amino add sequence of mouse ocdudin in Sequence No& 6 and 3. Escherichia coiiJfA 109 containing the human 
occludin cDNA tws been deposited (Deposition Na PERM BP-5477) with the hiational Institute of Biosdence and 
Human Technolos^, Ministry of International Trad and Industry, Japan (addr^- 1-1-3, Higashi, I sukito, Ibaraki, 305 ' 
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Japan) as of March 15^ 1996. 

P^arofpg Preparation of an anti-human ocdudin monocfonal antibody 

The cDI^ fragmert^encodlng the cytoplasmic region on the Olenrtnal side ot human ocdixlin vyas obtained by 
cuto.^ tt^ pBlu«cnpt SK(-) vector containing Sequence Ito 4 described in Example 1 with rest>i(«on enzymesSspl 
^ P-Pduds of rakara Shuzo Ca, Ltd.). Th« fragment was introduced into „<3EX-3X vectorrand 
^^t^'- ^ ««« to P^Para a GST fused protein. Rats'.ere immunized with 

this fused protein as an antigea so that a monoclonal antibody was prepared. 

The KA immunteation. was performed by infecting the antigen in doses of 300 (ig/injedion into 1 h a hindGmb paw rirst 
ll^^?* ?°^^ ^ **** thereafter (days 3 arel 7 after the fJtst). On 

ttietey following the last injecton. inguinal lymphnodes were excised from the immunized animals iind used lor ceiular 

Y^^. Silr*^"*^. "^"^ ^ '^^ ■'" * 2.5:1. and tlM> mixture was incu> 

bated « RPMI medium oontaining 1 g «rf polyethylene glycol (mean MW 4.000) dtesolved in it, for Z ndnutes accoiding 
to a modified meUu>d of KQhter et al., to permit fusion of the cells. Fused cells were seeded In HA-m^ plates with HAT 
«^um c«.ta»*, 10% HCF (Bokusui-Braun; tor 9 days, followed by incubation in HT medlim ar^SS^^jMn 
• • HyMdo^s were cloned by assaying supematanis of weHs showing c«ll jiar gioX>r arrth 

«^^n >IIJ^, ^"^ «^«<»n« anlfcody staining in cultures of human intestinal epithefial cell 

I^I^l^l^^a^!^'^'^^ ^ ^''^'^ were seeded at calculatiKi concentration of 7 

raWsA«all Innwr^ plates, and screened by immunobkitling technique to venfy and Isolaie cUnal hybridoma cell 
strains. Antibodies were purified from culture supernataits of said hybiidbma. 

Example 3 Cell staining 

Human inlesfind epilheB^ room temperature 

tor 15 mfnt^ arri further treated with 0.2% Triton X-100 In PBS at room temperature for 15 miinrtes. After bteddng 

S^^.Tr'i^ ^ ^ ^ '^'^^ ^ "^"^ed for 30 minutes a(^ 

femperaure^Afler a^uent washing. FITWabeled anti^af immunoglobuDn antibody was addcC and incubated for 
Mimmutes at room temperature, followed by washing off unreacted anfibody and examinatfon with a fluorescent micro- 

Results of dwWe immunofluorescent staining with monodonal antibody to the T^elated pictoin ZO-1 and the 
anb*umano«Surfn monoclonal antibody of the present hwention are shown In Ihefigure. As totally the same staining 
pattern as that of ZCh t (reported In the liteiature) was observed, the human protein of the present I t^entfon has prmen 
to be a human homologue of the TJ adhesion molecule ocdudin. •«.Hrov«i 

&™?I94 Express of ocdudin In cerebrovascular cells 

Since rarebialvascUar endothelial cells are thought to have a high electioreeisiant TJ, which lorn the braintiood 
barrier unlike penpheml vascular endothelial cells. I examined the distributicn and expressfon of oxludin in cultured 
SieSr^Li^C;"**"^ ""^"^ e<«*oresistant TJ and c««ured pordne aortic 

as Pnmere 1359-1391 sense strand (Sequence Na 7) and 1692-1721 antisense strand (Sequence Na ffl^from the 
human o«Sirfn DNA sequence (Sequence Na 4). TTw amno add sequence based on analysis cf the coding base 

canine adudins, thus verrfying the fragnteni to be a cONA for poidne ocdudin. 82p^abeled said bagment was used 

ifl^^^Sl"I^t^"!!**^I!'*^ ^ ^ electrophoresed sanple was transfarad onto nitrooellu- 

^tZ^?? wrth the c DMA probe under highly stringent conation, using an RNA isolation Kit 

(Stratagene)^As a result, the ocdudin mRNA showed a strong bawf at about 2.4 M> in PBEC. whcr»s in PAEC only 
a very weak band was noted at ttialposilioa i>i r«cv,. a™y 

" "^"^ compared using arti-mouse ocdudin antibody as a rronodonal antibady 
spedfraByrecognizingmammaSanocdudins and an antibody against ft^ 

anfln«f^rrr^?^iS^!f '^^[^'''^P^ protein asa 

antlgw. and FITC-Iabeled anh-rat IgQ sheep antibody was used for detection of said antibody VWnen eaual oroteln 

phores«. a strong band of ocdudin was detected at about 58KD in PBEC while a considerably wea»«,r band was noted 
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at that poshron in PAEC. On the other hand, there was no appreci^e difference in expression oT ZO-1 between the two 
types of ceits. With immunostaiiifng. PBEC axhSsited a marked ocdudin expression with the same continuous inlercef- 
Mar locafization as Z01, as seen in the immunoblottffig Gti^ty. In PAEC, In contrast, ocdudin was {^carcety detected 
and ZO-1 showed a discontinuous intercellular localization. These resutts suggest that the relatively marked expression 
5 of ocdudin m PBEC i& required for the formation of the Nghly el ectroresistant TJ. and provide evider »ce that oodiidrn is 
the constituent protein of t}>e TJ. 
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SEQUENCE 

SEQ ID NO: 1 . 

s 

SEQUENCE LENGTH: 522 
SEQUENCE TYPE: amino acid 
w TOPOLOGY: linear 

MOLECULE TYPE: protein 
ORIGINAL SOURCE 

15 

QR6ANISM: human (intestinal epithelial cell strain r84) 
SEQUENCE DESCRIPTION 
» Met Ser Ser Arg Pro Leu Glu Ser Pro Pro Pro Tyr Arg Pro Asp GIu 

1 5 10 Hi 

Phe Lys Pro Asn His Tyr Ala Pro Ser Asn Asp He Tyr Gly Gly Glu 

20 25 , 30 

Met His Val Arg Pro Mel Leu Ser Gin Pro Ala Tyr Ser Phe Tyr Pro 

35 40 45 

Glu Asp Glu lie Leu His Phe Tyr Lys Trp Thr Ser Pro Pro Gly I'al 

50 55 60 

He Arg lie Lea Ser Mel Leu lie He Yal Met Cys He Ala He Fhe . 
65 70 75 80 

Ala Cys Va'l, Ala Ser Thr Leu Ala Trp Asp Arg Gly Tyr Gly Thr Ser 

85 90 95 

Leu Leu Gly Gly Ser Val Gly Tyr Pro Tyr Gly Gly Ser Gly Phe Gly 

100 105 110 

Ser Tyr Gly Ser Gly Tyr Gly Tyr Gly Tyr Gly Tyf Gly Tyr Gly Tyr 

115 120 125 

Gly Gly Tyr Thr Asp Pro Arg Ala Ala Lys Gly Phe Met Leu Ala lilst 



35 



40 
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35 



130 135 140 

Ala Ala Phe Cys Phe He Ala Ala Leu Val He Phe Val Thr Ser Val 
145 150 155 160 

He Arg Ser Glu Met Ser Arg Thr Arg Arg Tyr Tyr Leu Ser Val He 

165 170 175 

He Val Ser Ala He Leu Gly He Met Val Phe He Ala Thr He Val 

180 186 190 

Tyr He Met Gly Val Asn Pro Thr Ala Gin Ser Ser Gly Ser Leu Tyr 

195 200 205' 

Gly Ser Gin He Tyr Ala Leu Cys Asn Gin Phe Tyr Thr Pro Ala Ala 

210 215 220 

Thr Gly Leu Tyr Val Asp Gin Tyr Leu Tyr His Tyr Cys Val Val Asp 
225 230 235 ?A0 

Pro Gin Glu Ala He Ala He Val Leu Gly Phe Met He He Val Al-i 

245 250 255 

Phe Ala Leu He He Phe Phe Ala Val Lys Thr Arg Arg Lys Mbt Asd 

260 265 270 

Arg Tyr Asfi Lys Ser Asn He Leu Trp Asp Lys Glii Hijs He Tyr Asj) ■ 

275 280 285 

Glu Gin Pro Pro Asn Val Glu Glu Trp Val Lys Asn Val Ser Ala Gly 

290 295 300 

Thr Gin Asp Val- Pro Ser Pro Pro Set Asp Tyr Val Glu Arg Val Asp 
305 310 315 320 

Ser Pro Met Ala Tyr Ser Ser Asn Gly Lys Val Asn Asp Lys Ara Phe 

325 330 335 

Tyr Pro Glu Ser Ser Tyr Lys Ser Thr Pro Val Pro Glu Val Val Gin 



BS 
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340 345 350 

Glu Leu Pro Leu Thr Ser Pro Val Asp Asp Phe Arg Gin Pro Ar« Tyr 

355 360 365 

Ser Ser Gly Gly Asn Phe Glu Thr Pro Ser Lys Arg Ala Pro Ala Lys 

370 375 380 

Gly Arg Ala Gly Arg Ser Lys Arg Thr Glu Gin Asp His Tyr Glu Thr 
385 390 395 400 

Asp Tyr Thr Thr Gly Gly Glu Ser Cys Asp Glu Leu Glu Glu Asp Frp 

405 410 ' , 4l;i 

He Arg Glu Tyr Pro Pro He Thr Ser Asp Gin Gin Arg Gin Leu fyr 

420 425 430 

Lys Arg Asn Phe Asp Thr Gly Leu Gin Glu Tyr Lys Ser Leu Gin Ser 

435 440 445 

Glu Leu Asp Glu He Asn Lys Glu Leu Ser Arg Leu Asp Lys Glu l.eu 

450 .455 460 

Asp Asp, Tyr Arg Glu Glu Ser Glu Giu Tyr Met Ala Ala Ala Asp Glu 
465 . 47q . .475 . ., , 480 

Tyr Asn Arg; Leu Lys Gin Val Lys. Gly Ser Ala Asp Tyr Lys Ser Lys 

.485 490 , . 495 

Lys Asn His Cys Lys Gin Leu Lys Ser Lys Leu Ser His He Lys lys 

500 505 , .510 

Met Val Gly Asp Tyr Asp Arg Gin Lys Thr 

515 ... .. , 520 , 

SEOUENCE ID NO: 2 
SEQUENCE LENGTH: 521 
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SEQ TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
ORIGINAL SOURCE 

ORGANISM: canine (kidney cell MDCK) 
SEQUENCE DESCRIPTION 

Met Ser Ser Arg Pro Phe Glu Ser Pro Pro Pro Tyr Arg Pro Asp Gli 

1 S 10 15 

Phe Lys Pfo'Asn His Tyr Ala Pro Ser Asn Asp Val Tyr Gly Gly Asp 

■ -20 25 30 

Met His Val Arg Tro Mel Leu Ser Gin Pro Ala Tyr Ser Phe Tyr Pro 

35 40 45 

Glu Asp Glu He Leif Kis Phe Tyr Lys Trp Thr Ser Pro Pro Gly VjiI 

50 55 60 

He Arg He Leu Ser Met Leu Val He Val Met Cys He Ala He Phe 
65 " 70 75 80 

Gly Cys Val Ala Ser Thr Leu Ala, Trp Asp Arg Gly Tyr Gly Thr Gh 
^ y^s., .. : :9p . 95 '* 

Leu Met Gly Gly Ser He Gly Tyir Pro Tyr Gly Ser Gly Ptie Gly Ser 

lOO ' 105 ^ - 116 

Tyr Gly Thr Gly Tyr Gly Tyr Gly Phe Gly Tyr Gly Tyr Gly Tyr Gly 

115 120 125 

Gly Tyr Thr Asp Pro Arg Ala Ala Lys GlV'Phe Leu Leu Ala Met Val 

130 135 140 

Ala Phe Cys Phe He Ala Ala Leu Val He Phe Val Thr Ser Val II2 
145 150 155 [1)0 
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20 



SO 



Arg Ser Asp He Ser Arg Thr Arg Arg Tyr Tyr Leu Thr Val He He 
165 170 175 

180 185 190 

He Met Gly Val Asn Pro Thr Ala Gin Ala Ser Gly Ser Leu Tyf Ser 

195 200: 205 

Ser Gin He Tyr Ala Met Cys Asn Gin Phe Tyr Ala Ser Thr Ala Tir 

210 215 220 

Gly Leu Tyr Met Asp Glji Tyr Leu Tyr His Tyr Cys Val Val Asp Pro 
225 230 .235 240 

Gin Glu Ala He AU He Val Leu Gly Phe; Met Val He Vai- Ala Phe 

245 250 255 :' 

Ala Leu He He Phe Phe Ala Val Lys Thr- Arg Arg. Lys Mel Asp Aig 

260 265 270 

Tyr Asp Lys Ser Asn He Leu Trp Asp Lys Glu His He Tyr Asp Glu 

27& 280 285 

Gin Pro Pro Asn Val Glu Glu Trp Val; Lys Asn Val Ser Ala Gly Thr • 

290 295 300 

Gin Asp Met Pro Pro Pro Pro Ser Asp Tyr Val Glu Arg. Yal Asp. Ser 
305 310 315 ;J25 

Pro Met Ala Tyr Ser Ser Asn Gly Lys Val Asn Asp Lys Arg Leu Tyr 

325 330 335 

Pro Glu Ser Ser Tyr Lys Ser Thr Pro Val Pro Glu Val Val Gin Glti 
340 345 .350 r.. ; 

Leu Pro Ala Thr Ser Pro Ala Asp Asp Phe Arg Gin Pro Arg Tyr S(;r 

355 360 365 

Ser Ser Gly His Leu Glu Pro Pro Ser Lys Arg Ala Pro Ser Lys Gly 
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370 375 380 

Arg Thr Gly Arg Pro Lys Arg Leu Glu Gin Asp His Ty; Glu Thr hp 
385 390 395 400 

Tyr Thr Thr Gly Gly Glu Ser Cys Asp Glu Leu Glu Glu Asp Trp lie 

405 410 415 

Arg Glu Tyr Pro Pro He Thr Ser Asp Gin Cln Arg Gin Leu Tyr Lys 

420 425 430 

Arg Asn Phe Asp Thr Gly Leu Glji Glu Tyr Lys Ser Leu Gin Ala Glu 

435 ; 440 445 

Leu Asp GItf lie Asn Lys Glu Leu Ser Arg Leu Asp Lys Glu L^u A:n» 

450 . 455 ■ 460 

Asp Tyr Arg Glu Glu Ser Glu Glu Tyr Mel Ala Ala Ala Asp Glu Tyr 
465 470 475 480 

Asn Arg Leu Lys Gin Val Lys Gly Ser Pro Asp Tyr Lys Asn Lys Arj: 

485 490 495 

Asn Tyr Cys Lys Gin. Leu Lys Ser Lys Leu Ser His He Lys Lys Met 

500 505 510 

Val Gly A-sp Tyr Asp Arg Gin Lys Thr ■ - ■ . - ^ r-' 

515 520 

SEQ ID NO: 3 ? 
SEQUENCE LENGTH: 521 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
ORIGINAL SOURCE 
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ORGANISM: mouse (lung cell) 
SEQUENCE DESCRIPTION. 

Mel Ser Val Arg Pro Phe Glu Ser Pro Pro Pro Tyr Arg Pro Asp tlu 

15 10 15 

Phe Lys Pro Asn His Tyr Ala Pro Ser Asn Asp Met Tyr Gly Gly Glu 

20 25 30 

Met His Val Arg Pro Met Leu Ser Gin Pro Ala Tyr Ser Phe Tyr Pro 

35 40 45 , 

Glu Asp Glu He Leu His Phe ^Tyr Lys Trp Thr S.er Pro Pro Gly Val 

50 55 60 

lie Arg He Leu Ser Met Leu He He Val Met ,CysJle Ala He Plie 
65 70 ,75 80 

Ala Cys Val Ala Ser Thr Leu Ala Trp Asp Arg Gjly Tyr Gly Thr Gly^ 

85 90 95 

Leu Phe Gly Gly Ser Leu Asn Tyr Pro Tyr Ser Gly Phe Gly Tyr Gly 

100 105 110 

Gly Gly Tyr Gly Gly Gly Tyr Gly Gly Tyr Gly Tyr Gly Tyr Gly Gly 

115 : . 120 125 . 

Tyr Thr Asp Pro Arg Ala Ala Lys Gly Phe Leu, Leu Ala Met Ala Ala . 

130 ' 135 140 

Phe Cys Phe He, Ala Ser Leu Val He Phe Val Tfir Ser Val He Arg . 
145 150 155 150 

Ser Gly Met Ser Arg Thr Arg Arg Tyr Tyr. Leu lie ,yal I|e He Val - 

165 170 175 

Ser Ala He Leu Gly He Met Val Phe He Ala Thr He Val Tyr 11? 
180 185 190 
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Mel Gly Val Asrt Pro Thr Ala Gin Ala Ser Gly Ser Met Tyr Gly Ser 

195 200 205 

Gin He Tyr Mel He Cys Asn Gin Phe Tyr Thr Pro Gly Gly Thr Gly 

ZIO 215 220 

Leu Tyr Val Asp Gin Tyr Leii tyr His Tyr Cys Val Val Asp Pro Gin 
225 230 235 240 

,5 Glu Ala He Ala He Val Leu Gly Phe Met He He Val Ala Phe Ahi 

245 250 255 

Leu He He Phe Phe Ala Val' Lys Thr Arg Arg Lys Mel Asp Arg Tyr 

260 265 270 

Asp Lys Ser Asn lie Leu Trp Asp lys 'Glu His He Tyr Asp Glu CJln ' 

275 280 285 

Pro Pro Asn Val Glu Glu Trp Val Lys Asn Val Ser Ala Gly Thr (jln 

290 295 300 

Asp Met Pro Pro Pro Pro Ser Asp Tyr Ala Glu Arg Val Asp Ser Pro 
305 310 315 ' 3;!0 

35 Met Ala Tyr Ser Ser Asn Gly Lys Val Asn Gly Lys Arg Ser Tyr fro 

325 330 335 

Glu Ser Phe Tyr Lys Ser Thr Pro Leu Val Pro Glu Val Ala Gin Glu 

40 

340 345 350 

He Pro Leii Thr Leu Ser Val Asp Asp Phe Arg Gin Pro Arg Tyr Ser 
45 355 360 365 

Ser Asn Gly ^Asn Leu Glu thr Pro Ser Lys Arg Ala Pro Thr Lys Gly ' 
370 375 380 

60 

lys Ala Gly Lys Gly Lys Arg Thr Asp Pro Asp His Tyr Glu Thr Asp 
385 390 395 400 

55 
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Tyr Thr Thr Gly Gly Glu Ser Cys Glu Glu Leu GIu GIu Asp Tri) Val 

405 410 415 

Arg Glu Tyr Pro Pro He Thr Ser Asp Gin Gin Arg Gin Leu Tyr Lys 

420 425 430 

Arg Asn Phe Asp Ala Gly Leu Gin Glu Tyr Lys Ser Leu Gin Ala GIu 

435 440 445 

Leu Asp Asp Val Asn Lys Glu Leu Ser Arg Leu Asp Lys Glu Leu Asp 

450 455 . 460 

Asp Tyr Arg, Glu Glu Ser Glu Glu Tyr. Met Ala Ala 'Ala Asp Glu Tyr 
465 . 470 , :• 475- " - 4«0 

Asn Arg Leu Lys Gin Val Lys Gly Ser Ala Asp Tyr Lys Ser Lys Arg 
, 485 490 4Jlli ^ 

Asn Tyr Cys Lys Gin Leu Lys Ser Lys Leu Ser His li^ Lys Arg llet 

500 505 510 

Val. Gly Asp Tyr Asp Arg Arg Lys Pro 

515 520 - 

SEQ:ID Np; 4,;. ?< .--.rt ,; . •" 

SEQUENCE mCTH; 2379 

SEQUENCE TYPE: nucleic a?id: ■ ;.C 

STRANDEDNESS: double . , 
TOPOLOGY: linear . 

MOLECULE, TYPE:, cDNA • 
ORIGINAL SOURCE 

ORGANISM: human (intestinal epithelial ceil strain TJ4) 

FEATURE 
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Feature Key: mat peptide 
Feature Table Definition: 168.. 1733 
SEQUENCE DESCRIPTION 

CTCCCGCGTC CACCTCTCCC TCCCTGCTTC CTCTGGCGGA GGCGGCAGGA ACCGAGAG(:(: 60 

AGGTCCAGAG GGCCGAGGAG CCGGTCTAGG ACGCA6CAGA TTGCTHATC nGCAAGCr/i 120 

AAGGGCATTG CTCATCCTGA AGATCAGCTG ACCATTGACA ATCAGCCATG TCATCCAG(:(: 180 

CTCnCAAAG. TCCACCTCCT TACAGGCCTG ATGAATTCAA ACCGAATCAT TATGCACCAA 240 

GCAATGACAT ATATGGTGGA GAGATGCATG TTCGACCAAT GCTCTCTCAG CCAGCCTACl 300 

CTnTTACCG. AGAAGATGAA ATTCTTCACT TCTACAAATG GACGTCTCCT CCAGGAGTGA 360' 

» TTCGGATCCT GTCTATGCTC ATTATTGTGA TGTGCATTGC CATCTtTGCC TGTGTGGCCr 420 

CCACGCTTGC CTGGGACAGA GGCTATGGAA CTTCCCTTTT AGGAGGTAGT' GTAGGCTACC 480 

CTTATGGAGG AAGTGGCHT GGTAGCTACG GAAGTGGCTA TGGCTATGGC TATGGTTATG 540 

GCTATGGCTA GGGAGGCTAT ACAGACCCAA GAGCAGCAAA GGGCtTCATG HGGCCATGG 600 

CTGCCTTTTG THCATTGCC GCGTTGGTGA TCTrTGTTAC CAGTGTTATA AGATCTGAAJi 660 

so TGTCCAGAAC AAGAAGATAC TACHAAGTG TGATAATAGT GAGTGCTATC aGGGCATC/i -720* 

TGGTGTnAT TGCCACAAH GTCTATATAA TGGGAGTGAA CCCAACTGCT CAGTCnCTC; 780 

GATCTCTATA TGCHCACAA ATATATGCCC TCTGCAACCA ATTHATACA CCTGCAGCTA 840 

as 

CTGGACTCTA CGTGGATCAG TAnTGTATC ACTACTGTGT TGTGGATCCC CAGGAGGCCA m- 

TTGCCATTGT ACTGGGCnC ATGATTATTG TGGCTTTTGC THAATAATt TTCTTTGCTG ^60 

TGAAAACTCG AAGAAAGATG GACAGGTATG ACAAGTCCAA TATTHGTGG GACAAGGAAC 1020 

ACATTTATGA TGAGCAGCCC CCCAATGTCG AGGAGTGGGT TAAAAATGTG TGTGCAGGCA 1080 ' 

CACAGGACGT GCCTTCACCC CCATCTGACT ATGTGGAAAG AGHGACAGT CCCATGGCAT 1140 

45 

ACTCTTCCAA TGGCAAAGTG AATGACAAGC GGTnTATCC AGAGTaTCC TATAAATGCA 1200 • 

CGCCGGnCC TGAAGTGGn CAGGAGCTTC CATTAACTTC GCCTGTGGAT GACTTCAGGC 1260 

^ AGCCTCGHA CAGGAGCGGT GGTAACTTTG AGACACCTTC AAAAAGAGCA CGTGCAAAGG 1320 

GAAGAGCAGG AAGGTCAAAG AGAACAGAGC AAGATCACTA TGAGACAGAC TACACAACTG .1380 

ss 
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GCGGCGAGTC CTGTGATGAG CTGGAGGAGG ACTGGATCAG GGAATATCCA CCTATCAC'IT J 440 
CAGATCAACA AAGACAACTG TACAAGAGGA ATTTTGACAC TGGCCTAGAG GAATACA;i(;A 1500 

5 

GCTTACAATC AGAACHGAT GAGATCAATA AAGAACTCTC CCGITTGGAT AAAGAATT(» 1560 
ATGACTATAG AGAAGAAAGT GAAGAGTACA TGGCTGCTGC TGATGAATAC AATAGACTCA 1620 

10 AGCAAGTGAA. GGGATCXGCA GAHACAAAA GTAAGAAGAA TCAnCCAAG CAGnAA/tA 16,80 

GCAAATTGTC ACACATCAAG AAGATGGTTG GAGACTATGA TAGACAGAAA ACATAGAAGG 1740. 
CTGATGCCAA GTTGnTGAG AAATTAACTA TCTGACATCT CTGCAATCTT aWGAAGGC ,1800 
AAATGACm GGACCATAAC CCCGGAAGCC AAACCTCTGT GAGCATCACA AAGTTTT6GT 1860 
TGCTTTAACA.TCATCAGTAT TGAAGCAHT TATAAATCGC TTTTQATAAT CAACTGGGfT 1920. 

so GAACACTCCA AHAAGGATT TTATGCTTTA AACAnGGTT CTTGTATTAA GAATGAAAIA 1980 

CTGTTTGAGG TTTTTAAGCC TTAAAGGAAG GTTCTGGTGT GAACTAAACT TTCACACQCC 2040 . 
AGACGATGTC TTCATACCTA CATGTATTrC TTTGCATAGG TGATCTCAH TAATCCTCT" 2100- 

^ AACCACCTTT CAGATAACTG nATmiAA TCACTTTm CCACATAAGG AAACTeGCff 2I6Q 

CCTGCAATGA AGTCTCTGAA GTGAAACTGC TTGniCCTA GCACACACH TTGGnAAGT 2220 
CTGTTTTATG ACTTCAHAA TAATAAAnC CCTGGCCnT CATATTTTAG CTACTATATiV 2280. 
TGTGATGATC TACCAGCCTC CCTATTTTTT TTCTGTTATA TAAAIGGHA AAAGAGGrr; 2340 
TTCHAAATA ATAAAGATCA TGTAAAAGTA AAAAAAAAA ' • 2379 

35 . . . . ^ 

SEQ ID NO: SV 
SEQUENCE LENGTH: 1961 
SEQUENT fjPE: nucleic acid 
STRANDEDNESS: double ^ 
^. TOPOLOGY: linear 

MOLECULE TYPE: cDNA 
ORIGINAL SOURCE 

so 

0RGAN194: canine (kidney cell strain MDCK) 

6S 
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FEATURE 

Feature Key: mat peptide 
Feature Table Definition: 72.. 1624 
SEQUENCE DESCRIPTION 

CACGnCGCT TATTTTCGGG AGCTCTGGGA TCCTGCTCGT CCTGAAGATC GGGTGATCAT 60 
TGACATCAGC CATGTCATCG AGGCCHTTG AGAGTCCACC TCCGTATAGA CCTGATGAAT 120 
TCAAACCCAA tCATTATGCA CCGAGCAATG ATGTGTACGG TGGGGACATG CACGTCCGAC 180 
CCATGCTCtC TCAGCCGGCG TAnCTTTCT ACCCAGAAGA' TGAAATTCTT CACTTCTACA 240 
AATGGACCtC TCCtCCAGGA GTAATTCGGA TTCTGTCCAT GCncfCAn GTCATGTGCA 300 
rCGCCATATT TGGCTGTGTC GCGTCCACGC TCGCCTCGGA TAGA«;tTAT GGAACTGGCr 360 
TAATGGGtGG TAGCATAGGC tACCCTfACG GAAGTGGdTT CGGGAGCf AC dGGACTGGCT 420 
ACGGCTACGG GTnGGCTAC GGCTACGGCT ACGGCGGCTA CACGGAVcCC AGAGCAGCiU 480 
AGGGCTTCCT CCTGCtCATG efCGCCrrTT GITTlATCGd TGCATTGGTG ATATtTGTTA 540 
CCAGCGTTAT AAGGTCTGAC ATATCCAGAA CCAGAAGGTA CTACnGACT GTAATAATiiC 600 
TGAGTGCCTT CCTGGGCGTC ATGATGTTCA HGCTACAAT TGTCTATATA ATGGGAGTCA 660 
ATCCAACTGC CCAGGCnCT GGGTCTTTAT ACAGTTCACA GATATATGCC ATGTGCAACC 720 
AGHCTATGC ATCTACAGCT ACCGGACTCT ACATGGATCA GTATTTGTAT CACTACTGIG 780 
TGGTGGATCC CCAAGAGGCA ATTGCCATTG TCCTGGGATT CATGGTGAn GTGGCnTKi 840 
CTTTAATAAT TTTCTTTGCT GTGAAAACTC GAAGAAAGAT GGACCGGTAT CACAAGTCGA 900. 
ATATATTGTG GGACAAGGAA CATATTTATG ATGAACAACC CCCCAATGTT GAAGAGTGGG .960 
HAAAAACGT TTCTGCAGGC ACACAAGACA TGCCTCCTCC CCCnCTGAC TATGTGGAGA 1020 
GAGTGGACAG TCCCATGGCG TACTCHCCA ATGGTAAAGT GAATCACAAG CGGTTGTATC 108Q 
CAGAGTCTTC CTATAAATCA ACACCGGTCC CCGAAGTGGT GCAGGACCTG CCCGCCACCT 1140 
CCCCTGCGGA TGACHCAGG CAGCCTCGCT ACAGCAGCAG CGGGCACTTG GAGCCACCTI 1200 
CGAAGAGGGC CCCCTCGAAA GGAAGAACGG GAAGGCCCAA GAGGCTGGAG CAGGACCACT 1260 
ATGAGACAGA CTACACGACG GGCGGCGAGT CGTGTGACGA GCTGGAGGAG GACTGGATCA 1320 
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GGGAATATCC ACCTATCACT TCAGATCAAC AAAGACAACT CTACAAGAGA AATTTTGACA 1380 
CTGGCCTGCA GGAATACAAG AGQIACAAG CA6AACTTGA JGAGATCAAT AAAGAACTCT 1440 
CTCGCCTGGA TAAAGAAHG GATGACTATA GAGAAGAAAG TGAAGAGTAC ATGGCTGdG 1500 
CTGATGAGTA CAATAGACTG AAGCAAGTTA AGGGATCTCC AGATTACAAA AATAAGAGCA 1560 
ATTATTGCAA GCAGHGAAG AGCAAATTGT CCCACATCAA.. GAAGATGGTT GGAGACTAIG 1620 
ATAGACAGAA AACATAGAAG GCAGATGCCA CACAGHTGA GAGATTGTGA AGTATTTGAC 1680 
ATATCTGCAA CGTTGTCAGA AGGCAGAATG ACTTTGGATT TCGAACCCAG GAGGCCAGAT 174Q 
CTTTGTGATC AHACAAAGT TrTGG^fAGCT .TTAATATCAT CAGXATTGAA GCATTTTACA 1800 
CATAGCTTTT GATAATCAAC TGireCTGAAC ACTCCCGATT AAGGATTCTG TGCTTTAGAC; 1860; 
TTTGGCTGTT GTCQTAAAGG ACTGAGTATA GGTGGAGGTT.. TTCAGACCTT GGAAGAAGGF 1920 
CCCACGGTGA ACTTCTGCTG TGAACTTGCA CACnGGGGC A . 1,961- 

SEQ ID NO.;. 6 . . . . .. . 

SEQUENCE LENGTH;, 2839 
SEQUENCE. TYPE: nucleic acid , - 
STRANDEDNE;SS: douWp, V 
TOPOLOGY:, linear, ,. 

MOLECULE: TYPE:. cDNA ■ - . , ... ... ... . , . . . : ,; . . 

ORIGINAL Sj3URCE.. , . .. 

ORGANISM: Bouse (lung eel l>:v , . . 
FEATURE . . •, • • Hi...: - . . '. 

Featuje, Key: mat peptide; = ■ 

FeatHre Table Definition: 223 / 1785 , 
SEQUEN(X DESCRIPTION 

GGAGTTTCAG GTGAATGGGT CACCGAGGGA GGAGGCTGGC CACGCCACAC CTCGTCGCTA 60 
GTGCCCACCT CCCGGCCCCT GTTTCCTTAG GCGACAGGGG TGGAGnCCG GGAGAGCGGT 120 
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CCAGCGCACG GAGCAACCGG QAGGGGCTC GGCAGGTTCG CTTATCnOG GAGCCTGGAC 180 
AniTGCTCA TCATAAAGAf TAGGTGACCA GtGACATCAC CCATGTCCGT GAGGCCTTTT 240 

a 

GAAAGTCGAC CTCCTTACAG ACCTGATGAA nCAAACCCA ATCAtTATGC ACCAAGCAAT 300 
GACATGTATG GCGGAGAGAT GCATGTCCGG CCGATGCTCT CTCAGCCAGC GtACTCTTTT 360 
TATCCGGAAG ATGAAATTCT TCACtrCTAC AAATGGACGT CGCCCCCAGG GGTGATCCGG 420 
ATCCTGTCTA TGCTCATTAt TGTCATGTGC ATCGCCATAT nGCCTGTGT GGCTTCCACA 480 
CTTGCnGGG ACAGAGGCTA TGGGACACdG CTCtTTGGAG GAAGCCTAAA CTACCCTTAT 540 
AGTGGCTTTG GCTACGGAGG TGGCTATGGA GGCGGCTATG GAGGCTATGG CTATGGCTAT 600 
GGCGGATATA CAGAiECCAAG AGcAGGCAAA GGCTTCCTGT TGGCCATGiGc' AGCCTTCiTGC 660' 
^ nCATCGCTT CCTTAGTAAT AmGTGACC-AfiTGTTATAA GATCTGCAAt GTCCAGGACA 720 

AGAAGATAn ACTTGATCGT GATCATAGTC AGCGCTAT CC TGGGCATCAT GGTCmATT 780 
GCCACGATCG TGTACATAAT GGGAGTGAAC CCGACGGCCC AGGCnCTGG ATCTATGTAC 840 

25 

GGCTCACAGA TATATATGAT CTGCAACCAG TlTTATAaC CTGGAGGTAC TGGTCTCTAC 96b 
GTGGATCAAT ATTTGTATCA aACTGTGTG GHGATCCCC AGGAGGCTAT AGCCATTGTC 960 
^ CTGGGGTTCA TGATTATCGT GGCTTTTGCT TTAATCATCT TnTTGCTCT GAAAACCCGA 102O 

AGAAAGATGG ATCGGTATGA TAAGTCCAAT ATTTTGTGGG ATAAGGAACA CATTTATGAT lOSt' 
GAACAGCCCC CCAATGHGA AGAGTGGGTT AAAAATGTGT CTGCAGGCAC ACAGGACATG 1140 

as 

CCTCCACCCC CATCTGACTA TGCGGAAAGA GTTGACAGTC CAATGGCCTA CTCCTtCAAT' 1200 
GGCAAAGTGA ATGGCAAGCG ATCATACCCA GAGTCTTTCT ATAAGTCAAC ACCTCTGGTG'' 1260 
^ CCTGAAGTGG' CCCAGGAGAT TCCTCTGACC nGAGTGTGG'ATGAETTCAG GGAGCCTCGG 1320 

TACACCAGCA ATGGTAACCT AGAGACACCT TCTAAAAGGG CTCCCACGAA GGGGAAAGCA 1380 
GGAAAGGGCA AGAGGACGGA CCCTGACCAC TATGAAACAG ACTACACGAG AGGTGGGGAG 1440 

45 

TCCTGCGAGG AGCTGGAGGA GGACTGGGTC AGGGAATATG cacctatcac ttgagatcaajisoo 
CAAAGACAAC TCTACAAGAG AAATTTTGAT GCAGGTCTGC AGGAGTATAA GAGCTTACAG 1560 
" GCAGAACTAG ACCACGTCAA TAAAGACaC TCTCGTCTAG ATAAAGAGCT GGATGACTAC 1320 

AGAGAGGACA GTGAAGAGTA CATGGCTGCT GCTGATGAAT ATAATAGACT AAAGCAAGTT 1 380 

S5 
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AAGGGATCTG CAGATTATAA AAGTAAGAGG AAHACTGCA AGCAGTTGAA. GAGCAA/IITA 1740 
TCGCACATCA AGAGGATGGT GGGAGACTAT GACAGACGGA AAGCTTAGAG AGATGCCAGT 1800 

5 

TGCGGGAGAA GGGAGAGGTG CATCTGCCTG CACGATGTCT CTGCAAnCT CTCCAGAGGC. 1860 
AAACTGACTT TGGACTCTAA TCTGGGAAGT TAAAACTHG TGATCATTAC AAAGTTTCCA 1920 

,0 TCGCTHAAT TCCATCAGn TCCTATCTCC AGTAHGAAG CATTnATAA ATGCCTTrTG 1980 

ATAATTGACT GGGCTGAACA CTCCAATTAA GGATTnACA GTTTCAACAT TGATTCTTST 2040 
AnAAGAATT AAAATGnCC TTGAGGTTTT AAATGTCAAG AAAGGTCCTG GTGTGAGCTG. 2100 
TGATGTGTGT GAGCTGTGAT GTGAAGGTTC ACACGCCAGG CAGCGTGTTC CT^CAGGTi\G.2160 
ACCGTCTAAT CAATCnTGC AGCAGCCCTC AGGTGACTGT TATTOGAAT CAGGTTGrTT 2220 

^ nCGnTTCC AGACAGGCn TCTCTGTGTA GCCCTGGCTG ACCTAGAACT TACGCTGTi^G 2280 

ACCAGGCTGG CCnGAACTC ACACAGCTCC TCTGAGTGCT GCTGCAGGAG TTAACGTCGT 2340 
GGACCGGTAT CATCACTTn CCTGCGGTGA CTTCTCCAAA CTGAAACTGC TAAGGCAC1T 2400 
TTGGCTAAGT CTCmTATG ACTGCAAATG ACAGCATTCC TGCCnTGTA TTTCAGGG(A 2460 
AATACGATAC ATTATATCGG CCATGTTCCC CACCACTGTT TTTCTTATAT TGACTTTTM 2520 

* CAAATGAATA. GGAnATHT IGGCTTTACA TTTTTTCCTA ACACTTAAGA TCATATAA/\A 2580 

HAACAAATA TGTGAAATn AAGAAHGTA AATATATATT TACGTTTGAA AGATGATTTT 2640 
AAATCCAGGG TTAAAGTGCT THTATCTTG TATAGITTAC ATGCnnTT TTTTnTTGA 2700 
TAACCCACTA GACCHTCCA TTGTATCAGA GTATCCAATT ACAHTACAA TTATGACTrC 2760 
AATTGTAm CACAGGAATG CTCAAGTTTT GTACATATTT TATAAGGTAT TAAACCTGAF 2820 
GnCTCTTTC TAAAAAAAA 2839 

SEQ ID NO: 7 

45 

SEQUENCE LENGTH: 33 
SEQUENCE TYPE: nucleic acid 
" STRANDEDNESS: single 

TOPOLOGY: linear 

ss 
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MOLECULE TYPE: Other nucleic acid Synthesis DNA 

SEQUENCE DESCRIPTION 

TATGAGACAG ACTACACAAC TGGCGGCGAG TCC 

SEQUENCE ID NO: 8 ' 
SEQUENCE LENGTH: 30 
SEQBl^NCE TYPE : nuc I e i c ac i d ' ^ 
STRAKbfiDliESS: single 
TOPOLOGY: linear ' 

MOLEClIi^E TYPE; 6ther nucleic acid Synthesis DNA 
SEQUENCE DESCRIPTION 



^ ATCATAGTCT CCAACCATCT TCnCATCtC 



40 



24 
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SEQUENCE LISTING 



5 il) GENERAL INPORMATIOK: 

(i) APPLICANT: 

(A) NAME: Bisai Co. Ltd. 

(B) STREET: 6-10, Koishikawa, 6-chome, Bxinkyo-ku 

(C) CITY: Tokyo 

,0 (B) COUNTRY: Japan 

(P) POSTAL CODE (ZIP) : none 

(ii) TITLE OF INVENTION: Human aoUiesion molecule occludin 

(iii) NUMBER OP SEQCENCES; 8 

(iv) COMPUTER READABLE FORM: 

(A) MEDICM TYPE: Floppy disk 
^B> COMPUTER: IBM PC cotnpatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1,0, Version #1.30 (KPO) 

20 (v) CURRENT APPLICATION DATA: 

APPLICATION mmBR: EP 97 905 440.0 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 49 880/96 

(B) FILING DATE: 07-MAR-1996 

(vi) PRIOR APPLICATION DATA: 
^ (A) APPLICATION NUMBER: JP 331 944/96 

(B) FI-IyING DATE: 12-DBC-.1996 

(2) INFORMATION FOR SEQ ID NO: 1: 

30 (i) SEQUENCE CHARACTERISTICS; 

(A) . LENGTH: 522 amino acids- . 

(B) TYPE: amino acid 

(C) STRANDEDNBSS : 

(D) TOPOLOGY: Linear . ' ,. 

3S (ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) 0RC3ANZSM: Homo sapiens 

(Bj^ STRAIN: human intestinal epithelialp cell . , 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: Is 

Met Ser Ser Arg Pro Leu Glu Ser Pro Pro Pro Tyr Arg Pro Imp, Glu 
1 - 5 10 . 

45 Phe liys Pro Asn His Tyr Ala Pro Ser Asn Asp lie. Tyr Gly Cu.y Glu, 

20 '< 25 30 

Met His Val Arg Pro Met Leu Ser Gin Pro .Ala Tyr Ser Phe 'i^T Pro , 
35 40 45 



SO 



Glu Asp Glu lie Leu His Phe lyr Lys Trp Thr Ser Pro Pro Oly Val 

50 55 60 

He Arg He Leu Ser Met Leu He He val Met Cys He Ala IJ.e Phe 
65 70 .75 80 



66 



25 
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Ala Cys Val Ala Ser Thr Leu Ala Trp Asp Arg Gly Tyr Gly Thr Ser 
85 90 95 

5 Leu Leu Gly Gly Ser Val Gly Tyr Pro Tyr Gly Gly Ser Gly Phe Gly 

100 105 110 

Ser Tyr Gly Ser Gly Tyr Gly Tyr Gly Tyr Gly Tyr Gly Tyr Gly Tyr 
115 120 125 

Gly Gly Tyr Thr Asp Pro Arg Ala Ala Lys Gly Phe Met Leu Ala Met 
130 135 ^ 140 

Ala Ala Phe Cys Phe lie Ala Ala Leu Val lie Phe Val Thr Ser Val 
145 150 155 ISD 

lie Arg Ser Glu Met Ser Arg Thr Arg Arg Tyr "lyr Leu Ser Val lis 
75 165 170 175 

lie Val Ser Ala He Leu Gly He Met. Val Phe He Ala Thr Ilfr Val 
180 185 ;V 'V 190 

Tyr He Met Gly Val Asn Pro Thr Ala Gin Ser Ser Gly Ser Leu T^- 
^ 195 200 205 

Gly Ser Gin He Tyr Ala Leu Cys Asn Gin Phe Tyr Thr Pro Ala ALa 
210 215 .220. . 



25 



30 



Thr Gly Leu Tyr Val Asp Gin Tyr Leu Tyr His Tyr Cys Val Val A!?]3 
225 230 235 2-10 

Pro Gin Glu Ala He Ala He Val Leu Gly 'Phe Met Ile~ 1I0. val Ala 
245 250 255 

Phe Ala Leu He He Phe Phe Ala Val Lys Thr Arg Arg Lys Met Atip 
260 265 270 

Arg Tyr Asp Lys Ser Asn He Leu Trp Asp Lya Glu His He lyr Aiip 
275 2B0 285 

Glu Gin Pro Pro Asn Val Glu Glu Trp Val Lyai Asn Val Ser Ala; Gl.y 
290 295 300 

^ Thr Gin Asp Val Pro Ser Pro Pro Ser Asp Tyr Val Glu Arg Val Aiip 

300 310 31S ; _ -3;i(i 

V * 'u..- ■ • . \:. 

Ser Pro Met Ala Tyr S^r Ser Asn Gly Lys' Val Asn Asp Lys .Arg Phe 
325 330 335 

40 Tyr Pro Glu Ser Ser Tyr Lys Ser Thr Pro Val Pro Glu Val Val Olri 

340 ^^345 . .1 350 . 

Glu' Leu Pro Leu Thr ^Ser Pro Val Tlsp Asp Phe Arg Gin Pro Arg T\t 
355 360 365 

^ Ser Ser (Sly Gly ^ Asn "^Pha Glu Thr Pro Ser : Lya Arg Ala Pro Ala L\-6 

370 375 ^ 380 

Gly Arg Ala Gly Arg Ser Lys Arg The Glu Gin Asp His Tyr Glu Thi ■■■ 
3B5 390 395 4CC 

Asp Tyr Thr Thr Gly Gly Glu Ser Cys Asp Glu Leu Glu Glu Asp Tip 
405 410 415 

He Arg Glu Tyr Pro Pro He Thr Ser Asp Gin Gin Arg Gin Leu Tyi 
420 425 430 

SS 
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Lys Arg Asn Phe Asp Thr Gly Leu Gin Glu Tyr Lya Ser Leu Gin Ser 
435 440 445 

Glu Leu Asp Glu He Asn Lys Glu Leu Ser Arg Leu Asp Lys Gli Leu 
450 455 460 

Asp Asp Tyr Arg Glu Glu Ser Glu Glu Tyr. Met Ala Ala Ala Asia Glu 
465 470 475 460 

Tyr Asn Arg Leu Lys Gin Val Lys Gly Ser Ala Asp Tyr Lys Ser Lvs 
^0 465 490 / 49!; 

Lya Asn Hia Cys Lys Gin Leu Lys Ser Lys Leu Ser His , He Ly!> Lys 
50O 505 510 

Met Val Gly Asp Tyr Asp Arg Gin Lys Thr 
IS 515 520 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 521 amino acids 
[B) TYPE: amino acid ' - ' 

^ (C) STRANDEDNESS: 

(P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE; 
25 (A) ORGANISM: canine 

(B) STRAIN: canine 3tidney cell MDCK , 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



30 


Met 
1 


Ser Ser 


Arg 


Pro Phe Glu Ser. Pro Pro Pro Tyr Arg Pro Aspi Glu 

5- 10 15.; ' 




Phe 


Lys Pro 


Asn 
20 


His Tyr Ala Prp Ser Asn Asp Val Tyr Gly Glv; Asp 
25 30 


35 


Met 


Hj.8. Val 
, 35 


Arg 


Pro Met Leu Ser Gin Pro Ala Tyr Ser phe Tyi. Pro 
-.40 " 45 




Glu 


Asp- Glu 
50 


lie 


Leu His Phe Tyr Lya Tcp,Thr Ser Pro Pro, Gly , Val 

55' .. ■ . ' ' 60' . " ^" ■ ' 


40 


He 

03 


Arg lie 


Leu 


Ser .Met«Leu Val He Val Met Cys Hp Ala lie Phe 

70 ' 75 ' ^ ' ■ [ ' 




Gly 


Cys 5 Val 


Ala 


Ser Thr Leu Ala Trp Asp Arg Gly Tyr Gly Thr Gly 

B5. . 90;.. 95 , 


45 


Leu 


Met, Gly 


Gly 
100* 


Ser^ lie qiy Tyr Pro Tyr Gly- Ser Gly Phe Gly Ser 
105 iio ' 




Tyr 


Gly Thr 
115 


Gly 


Tyr Gly Tyr Gly Phe, Gly TVr- Gly Tyr . Gly, Tyr Gly 
120 "125 ■■■^•'■'^ 


SO 


Gly 


lyr. nir 
130' 


Asp 


Pro Arg Ala Ala Lys Gly Phe Leu Leu Ala Met Val 
135 140 




Ala 
145 


Phe Cys 


Phe 


lie Ala= Ala Leu Val He Phe Val Thr Ser Val lie 
150 155 160 



ss 
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Arg Ser Asp lie Ser Arg Thr Arg Arg Tyr Tyr Leu Thr Val He He 
165 170 175 

Leu Set Ala Phe Leu Qly Val Met* Met Phe He Ala Thr He Val Tyr 
IBO 185 190 

He Met Gly Val Asn Pro Thr Ala Gin Ala Set Gly Ser Leu Tyr Ser 
195 200 205 



10 



Ser Gin He Tyr Ala Met Cys Asn Gin Phe Tyr Ala Ser Thr Ala Thr 
210 ' 215 220 



15 



Gly Leu Tyr Met Asp Glh Tyr Leu Tyr His Tyr Cys Val Val Asp Pr^ 
225 230 235 240 

Gin Glu Ala He Ala He Val Leu Gly Phe Met Val He Val Ala Phia 
245 250 255 

Ala Leu He He Phe Phe Ala Val Lys Thr Arg Arg Lys Met Asp Ac? 
2€0 265 270 

Tyr Asp Lya Ser Asn He Leu Trp Asp Lys Glu His lie Tyr Asp GLu 
275 280 285 

Gin Pro Pro Asn Val Glu Glu Trp Val Lys Asn Val Ser Ala Gly Tic 
290 295 300 



26 



Gin Asp Met Pro Pro Pro Pro Ser Asp Tyr Val Glu Arg Val Asp Sisr 
305 310 315 3:20 

Pro Met Ala Tyr Ser Ser Asn Gly Lys Val Aisn Asp Lys Afgf- LeU Tfc 

325 330 335 



Pro Glu Ser Ser Tyr Lys Ser Thr Pro Val Pro Glu Val Val Gin Glu 
340 345 3Sa ' 

Leu Pro Ala Thr Ser Pro Ala Asp Asp Phe Arg Gin Pro Tixg Tyr Si}-: 

355 360 365 

Ser Seir Gly His Leu Glu Pro Pro Set Lys Arg Ala- Pro Ser Lys Gly 
370 375 380 

Arg Thr Gly Ar'J Pr6 Lys Ar^ Leii Glii Gin- Asp His "Tyr Glu Thr- Anii' 

365 390 393 . 4l><^ 

Tyr Thx Tht Oly ciy Glii Ser Cys Asp Glu Leu Glu Glii Asp Trp He 
405 410 415 

Arg Glu Tyt Pro Pro He Thr Ser Asp Gin Gin Arg Gin Leu- Tyr L^'fr 
420 425 430 

Arg Asn Phe Asp Thr Gly Leu Glri Glu Tyr Lya Ser Leu Gin Ala^ G].ii 

435 440 44$ 

Leu Asp Glii lie Asn Lys GlU Leu Ser Arg Leu Asp Lys Glu Leu- At^{^ 
450 ^ 455 460 

Asp Tyr Arg Glu Glu Ser Glu Glu Tyr Met Ala Ala Alfii Asp Glii Tjt 
465 470 475 4C C 

Asn Arg Leu Lys Glfi val Lys Gly Ser Pro Asp Tyr Lys Asn Lys Aic 
485 490 495 

Asn Tyr Cys Lys Gin Leu Lys Ser Lys LeU Ser His He Lys Lys Met 
500 505 510 



68 
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IS 



20 



Val Gly Asp Tyr Asp Arg Gin Lys Thr 
5X5 520 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH:- 521 amino acids 

(B) TYPE: amino acid 

(C) STRANDGDNBSS : 

(D) TOPOIjOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: mouse 

(B) STRAIN: mouse lung^cell 



Cxi) SEQUENCE DESCRIPTION: SSQ ID NO: 3: 

Met Ser- Val Arg Pro Phe 01 u Ser Pro. Pro Pro Tyr Arg Pro Asp ^Glvi 
1 5 .10 :l:5 

Phe Lys\'Pro ASn His Tyr Ala. Pro Ser. Asn Asp Met Tyr Gly Gly Glu 
20 25 30 

Met' Hi 3 Val Arg Pro Met Leu^ Ser. Gln^ Pro Ala Tyr Ser Phe T/r Pro 
35 40 45 

Glu Asp Glu He Leu His Phe Tyr Ly& Trp Thr Ser Pi;© Pro -Sly Val 
50 55; 60' 

He Arg He Leu Ser Met Leu He He Val Met cys He Ala He Phe 
65 70 75 80 

30 

Ala Cys Val Ala Ser Thr Leti Ala Trp Asp Arg Gly Tyr Gly Thr Gly 

.85 '90 , 95 

Leu Phe Gly Gly Ser Leu Aan Tyr Pi?<5f Tyr Ser <3ly Phe Gly Tyrr Gly- 
100 105 110 

^ Gly. Gly Tyr Gly Gly Gly Tyr Gly Oly Tyr Gly Tyr Gly Tyr Gly-Gly - 

115 120 ' X25 

lyr ' Thr Asp Pro' Arg Al^ Ala xya Gly Phe Leu Leu Ala Met aia Ala '= 
13b 135 140' 

^ Phe ty^ Phe ip.e Ala Ser Leu Val* Tie Phe Val -Thf S&r Val ' He Arg 

145 150 155 160 

Ser Gly Met Ser Arg Thr Arg Arg Tyr Tyr Leu He Val He He Val 
165 170 175 

4S Ser Ala lie Leu Gly He Met Val Phe lie Ala Thr He Val 7yr lie 

180 185 . 190 

Met Gly Val Asn Pro Thr Ala Gin Ala Ser -Gly Ser Met Tyr Gly Ser 
195 200 205 

„ Gin He Tyr Met He Cys Asn Gin Phe Tyr Thr Pro Gly Gly Ihr Gly 

210 215 220 

Leu Tyr Val Asp Gin Tyr Leu Tyr His Tyr Cys Val Val Asp Iro Gin 
225 230 235 240 



ss 
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Glu Ala lie Ala lie Val Leu Gly Phe Met He He Val Ala Phe \la 
245 250 255 

Leu He He Phe Phe Ala Val Lys Thr Arg Arg Lyg Met Asp Arg Tvr 
^ 260 265 270 

Asp Lya Ser Asn. He Leu Trp Asp Lys Glu His He Tyr Asp Glu (51n 
275 2B0 2B5 

Pro Pro Asn Val Glu Glu Trp Val Lys Asn Val Ser Ala Gly Thr (31n 
10 290 295 300 

Asp Met Pro Pro Pro Pro Ser Asp Tyr Ala Glu Arg Val Asp Ser l^ro 
305 310 315 :t20 

Met Ala Tyr Ser Ser Asn Gly Lya Val Asn Gly Lys Arg Ser Tyr Pro 
IS 325 330 335 

Glu Ser Phe Tyr Lys Ser Thr Pro Leu Val Pro Glu Val Ala Gin CIlu 
340 345 : 350* 

He Pr<3 Leu Thr Leu Ser Val; Asp Asp^ Phe Arg Gin Pro Arg Tyr £;er 
355 360 , 365 

SO , 

Ser Asn Gly Asn Leu Glu < Thr Pro Ser Lya Arg Ala Pro Thr Lys c;iv 
370 375 380 
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Lys Ala Gly Lys Gly Lys Arg Thr Asp Pro Asp His Tyr Glu Thr 
3B5 390 395 <,00 

Tyr Thr Thr Gly Gly Glu Ser Cys Glu Glu Leu Glu Glu Asp Trp Val: 
405 410 , 415 

Arg Glu Tyr Pro Pro He Thr Ser Asp Gin Gin Arg Gin Leu Tyr Lys 
420 425 430 

Arg Asn Phe Asp Ala . Gly Leu Gin Glu Tyr Lys Ser Leu Gin Ala C!lu 
435 440 445 

Leu Asp Asp: Val Asn Lys Glu LeiJ^ Ser Arg Leu> Asp Lys Glu Leu /so 
450 455 460 

35 Asp .Tyr Arg Glu Gly Ser: Glu jGlu Tyr Met Ala Ala Ala Asp. Glu IVr 

465 .470 475 ' 480 

Asn Arg Leu Lys Gin V&l Lys .Gly Ser, Ala Asp Tyr Lya Ser Lys ilxg 
485 490 495 



40 



Asn Tyr Cye Lyg Gin Leu Lys "Ser -Lys Leu Ser His He Lys Arg ^:et 
500 505 510 

Val Gly Asp Tyr Asp Arg Arg Lya Pro 
515 520 

(2) INFOHMATIOM. FOR SEQ LD HQ: 4? 

(i) SEQUENCE CHARACTERISTICS: 

(A^ L^GTHr 2379 base pairs 

(B) TYPE: nucleic acxd 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vij . ORIGINAL SOURCE: 
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(A) ORGANISM: Homo sapiens 

(B) STRAIN: human intestinal epithelial cell strain T84 

(ix) FEATURE: 
5 (A) NAME/KEY: matjpeptide 

(B) LOCATION: 168. .1733 

(xi) SEQUENCE DESCRIPTI<»I: SEQ ID NO: 4: 

10 CTCCCGCGTC CACCTCTCCC TCCCTGCTTC CTCTGGCGGA GGCGGCAGGA ACCGAGAGCC 60 

AGGTCCAGAG CGCCGAGGAG CCGGTCTAGG ACGCAGCAGA TTGGTTTATC TTGGAAGa*. 120 

AAGGGCATTG CTCATGCTGA AGATCAGCTG ACCATTGACA ATCAGCCATG TCATCCAGCfC: • IBO 

CTCTTGAAAG TCCACCTCCT. TACAGGCCTG ATGAATTCAA ACC6AATCAT TATGCACC;Jt 240 

GCAATGACAT ATATGGTGGA GAGATGCATG TTCGACCAAT GCTCTCTCAG CCAGCCTACr::' 300 

CTTTTTACCC AGAAGATQAA ATTCTTCACT TCTACAAATG. GACCTCTCCT CCAGGAGTiyi 360 

TTCGGATCCT GtCTATGCTC ATTATTGTGA TGTGGATT6C- CATCTTTGCC TGTGTGGCtr.i: 420 

CCACGCTTGC CTGGGACAfiA GGCTATGGAA CTTCCCnTT AGSAGGTAGT GTAGGCTAO: , 480 

CTTATGGAGG AAGTOGCTTT GGTAGCTACG OAAGTGGCTA TGGCTATGGC TATGGTTA'PS 540 

GCTATGGCTA CGGAGGCTAT ACAGACCCAA GAGCAGCAAA GGGCTTCATG TTGGCCATUS 600 

CTGCCTTTTG TTTCATTGCC GCGTTGGTGA TCTTTGTTAC CAGTGTTATA AGATCTGAW 660 

TGTCCAGAAC AAGAAOATAC TACTTAAGTG TGATAATAOT GAGTdCTATC CTGGGCAT2\ 720 

TGGTGTTTAT TGCCACAATT GTCTATATAA TGGGAGTGAA CCCAACTGCT CAGTCTTCrS 780 

GATCTCTATA TGGTTCACAA ATATATGCCC TCTGCAACCA ATTTTATACA CCTGCAGCr^ 840 

CTGGACTCTA CGTGGATCAG TATTTGTATC ACTACTGTGT TGTGQATCCC CAGGAGGCCA- - 900 

TTGCCATTGT ACTGGGGTTC ATGATTATTG TGGCTTTTGC TITAATAATT TTCTTTGCrG : 960 

TGAAAACTC6 AAGAAAGATS GACAGCTATG ACAAGTCCAA TATTTTGTGG GACAAGGARC 1020 

AGATTTATGA TGAGCAGCCC CCCAATOTCG AGGAGTGGGT TAAAAATOTC TCTGCAGGCA 1080 

CACAGGACGT GCCTTCACCC CCATCTGACT ATGTGGAAAG AGTTGACAGT CCCATGGCflT 1140 

ACTCTTCCAA TGGCAAA6TG AATGACAAGC GGTriTATCC AGAGTCTTCC TATAAATCCA 1200 

CGCCGGTTCC TGAAGTGGTT CAGGAGCTTC CATTAACTTC GCCTGTGGAT 6ACTTCAGG:C 1260 

AGCCTCGTTA CAGCAGCGGT GGTAACTTTQ AGACACCITC AAAAAGAGGA CCTGCAA/(GfG 1320 

GAAGAGCAGG AAGGTCAAAG AGAACAGAGC AAGATCACTA TGAQACAQAC TACACAACniG 13 BO 

GCGGCGi\GTC CTGTGATQAG CTGGAGGAGQ ACTGGATCAG GGAATATCCA CCTATCACnT 1440 

CAGATCAACA AA6ACAACT0 TACAAGAGGA ATTTTGACAC T6GCCTACAG GAATACAiifJA 1500 

GCTTACAATC AGAACTTGAT GAGATCAATA AAGAACTCTC CCGOTTGGAT AAAGAAT^XJG 1S60 

ATGACTATAG AGAAGAAAGT OAAGAQTACA TGGCTGCTGC TGATGAATAC AATAGACXiA 1620 

AGCAAGTGAA GGGATCTGCA GATTACAAAA GTAAGAAGAA TCATTOCAAG CAGTTAAiWIA 1680 
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ACATAGAAGG 
CTCA6AAGGC 
AAGTnTGGT 
CAACTGGGCT 
GAATGAAATA 
TTCACACCCC 
TAATCCTCTC 
AAACTGGGTr 
TTGGTTAAGT 
CTAGTATATA 
AAAGAGGTTT. 



GCAAATTGTC ACACATCAAG AAGAT6GTTG GAGACTAT6A TAGACAGAAA 
CTGATGCCAA GTTGTTTGAG AAATTAAGTA TCTGACATCT CTGCAATCTT 
AAATGACTTT GGACCATAAC CCCGGAAGCC AAACCTCTGT 6AGCATCACA 
TGCTTTAACA TCATCAGTAT TGAAGCATTT TATAAATC6C TTTTGATAAT 
GAACACTCCA ATTAAGGATT TTATGCTTTA AACATTGGTT CTTGTATTAA 
CTGTTTGAGG TTTTTAAGCC TTAAAGGAAG GTTCTGGTST GAACTAAACT 
AGACGATGTC TTCATACCTA CATGTATTTG TTTGCATAGG TGATCTCATT 
AACCACCTTT CAGATAACTG TTAITTATAA TCACTTTTTT CCACATAAGG 
CCTGCAAT6A AGTCTCTGAA GTGAAACTGC TTGTTTCCTA GCACACACTT 
CTGTTTTATG ACTTCATTAA TAATAAATTC CCTGGCCTTT CATATTTTAO 
TGTGATGATC TAGCAGGCTC CCTATTTTTT TTCTGnTATA TAAATGGTTA 
TTCTTAAATA ATAAAGATCA - TGTAAAAGTA AAAAAAK^A. . • 

[2) INFORMATIOW FOR SEQ ID HO.: 5i ~ ' 

(i) SEQUBKCE CHARACTERISTICS: 

(A) LENGTH: 1961 base pairs 

(B) TYPE: nucleic. acid 

(C) STRANDEDNESS : double 

(D) * TOPOLOGY: lineaTr . . 

(ii) MOLECULE TYPE; cDKA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: canine 
- i- (B) STRAIN; canine kidney cell strain MDCK 

• (ix) FEATURE: - 

(A) tlAMB/KBY: mat jpeptide 

(B) LOCATION : 72.. 1624 ' i 

(xl) SEQaENCK DESCRIPTirai: SEQ ID NO: 5: 
CAGGTTGQCT TATITTGGGG AGCTCTGGQA TCCTGCTCGT CCTGAAGATC GGGTGATCAT 
TGACATCAGC CATGTCATCG AGGCCTTTTG AGAGTCCACC TCCGTATAGA CCTGATGAAT 
TCAAACCCAA TCATTATGCA CCGAGCAKPG ATGTGTACGG TGGGGACATG CACGTCCGAC 
CCATGCTCTC TCA6CCGGCG TATTCTTTCT ACCCAGAAGA TGAAATTCTT CACTTCTACA 
AATGGACCTC TCCTCCAGGA GTAATTCGGA TTCTOTCCAT GCTTGTCATT GTGATGTGCA 
TCGCCATATT TGGCTGTGTC GCGTCCACGC TCGCCTGGGA TAGAGGCTAT GGAACTGGCT 
TAATGGGTGG TAGCATAGGC TACCCTTACG GAAGTGGCTT CGGOAOCTAC GGGACTGGCT 
ACOGCTACGG GrTTGGCTAC GGCTACGGCT ACGGCGGCTA CACGGATCCC AGAGCAGCAA 
AGGGCTTCCT CCTGGCCATG GTGGCCTTTT GTTTTATCGC TGCATTGGTG ATATTTGTTA 
CCAGCGTTAT AAGGTCTGAC ATATCCAGAA CCAGAAGGTA CTACTTGACT GTAATAATAC 
TGAGIXSCCTT CCTGGGCGTC ATGATGTTCA TTGCTACAAT TGTCTATATA ATGGGAGTCA 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220; 
2280 
.2340 
: 2379 



* '60- 
120 
180 
240 
300 
360 
420 
480 
540 
60O 
660 
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ATCCAACTGC CCAGGCTTCT GGGTCTTTAT ACAGTTCACA GATATATGCC ATGTCCAACt: 720 

AGTTCTATGC ATCTACAGCT ACCX3GACTCT ACATGGATCA GTATTTGTAT CACTACTG^i^i 780 

S TGGTGGATCC CCAAGAGGCJl ATTGCCATTG TCCTGGGATT CATGGTGATT GTGGCTTT^XS 840 

CTTTAATAAT TTTCTTTGCT GTGAAAACTC GAAGAAAQAT CK3ACCGGTAT GACAAGTC(3J!k 900 

ATATATT6T6 GGACAAGGAA CATATTTATG ATGAACAACC CCCCAATGTT GAAGAGTG(K; 960 

^0 TTAAAAACGT TTCTGCAGQC ACACAAGACA TGCCTCCTCC CCCTTCTGAC TATqTGGA(5i^ 1020 

gagtggacag tcccatggcg tactcttcca atggtaaagt gaatgacaag cggttgta'w: ibeo 

CAGAGTCTTC CTATAAATCA ACACCGGTCC COGAAGTGGT QCAGGAOCTG CCCGqCACCP - 1140 

CCCCTGCGGA TGACTTCAQG CAGCCTCGCT ACAGCAGCAG CGGGCACTTG GAGCCACCrr , 12 00 

CGAAGAGGGC CCCCTCGAAA QGAAGAACGG GAAGGCCCAA GAGGCTC3GAG CAGGACCAirr 1260 

ATGAGACAGA CTACACGACq GGCGGCGAGT CGTGTGACGA GCTGGAGGAG QACTGGATIV 1320 

GGGAATATCC ACCTATCACT TCAGATCAAC AAAGACAACT CTACAAGAGA AATTTTGAf-'V 1380 

20 . . - 

CTGGCCTGCA GGAATACAAG AGCTTACAAG CAGAACTTGA TGAGATCAAT AAAGAACTITr 1440 

CTCGCCTGGA TAAAGAATTO GATGACTATA GAGAAGAAAG TGAAGAGTAC ATCqCTGCrS 1500 

CTGATGAGTA CAATAGACTG ^AAGCAAGTTA AGGGATCTCC AGATTACAT^ AATAAGAG3A 1560 

ATTATTOCAA GCAGTTGAAG AGCAAATTGT CXXT^CATCAA GAAGATGGTT^ GGAGACTAPG;, 1620 

ATAGACAGAA AACATAGAAG GCAGATGCCA CakCAGTITGA GAGATTGTGA AfiTATTTGftC .1680 

ATATCTGCAA CGTTGTCAGA AGGCAGAATG ACTTTGGATT TCGAACCCAG GA6GCCAGRT 1740 

30 CTTTGTGATC ATTACAAAGT TTTGGTAOCT TTAATATCAT C^AGTATTGAA GCATTTTACA 1800 

CATAGCTTTT GATAATCAAC TGGGCTGAAC ACTCCCGATT AAGGATOn^ 1850. 

TTTGGCTGTT GTGCTAAAGG ACTQAGTATA GGTGGAQGTT TTCAGACCTT GQAAGAAGGT , 1920 

3S CCCTICGGTCA ACTT(rn3C^X3 TG^ 1961 

[2) INFORMATION Fp|l SEQ 19, NQ: 6;. . . - , , : : „ ; 

(i) SEQUENCE CHARACTERISTICS I, 

CA) LENGTH: 2839 base pairs 
. (B) TYPE: nucleic acid 
^ (C) STRAfTOEDNESS : double 

(D) TOPOIjOGY: linear . , ^ 

(ii). MOLECULE TYPE! cDNA 

(vi) . ORIGINAL SOURCE r. . 
45 (A) ORGANISM: onouse 

(B) STRAIN: mou^e lung cell 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 
<B) LOCATION:223..1785 

60 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 6: 



55 



33 



Received from < > at 5/1/02 2:57:17 PM [Eastern Daylight Time] 



05/01/2002 11:47 FAX 



SEED IP LAW GROUP PLLC 



@|006 



EP0 831 148 A1 



15 



20 



25 



SO 



36 



GGAGTTTCAG GTGAATGGGT CACCGAGGGA GGAGGCTCGC CACGCCACAC CJtCGTCGCTA 60 

GTGCCCACCT CCCGGCCCCT CTTTCCTTAG GCGACAGCGG TGGAGTTGCG GGAGAGCGGT 120 

CCAGCGCACG GAGCAACCGG CTAGGGGCTC GGCAGdnTCG CTTATCTTGG GA6CCTGGAC 18 0 

ATTTTGCrCA TCATAAAQAT TAG6TGACCA GTGACATCAG CCATGTCCGT GAGGCCTTTT 240 

GAAAGTCCAC CTCCTTACAG ACCTGATGAA TTCAAACCCA ATCATTATOC ACCAA6CAAT 300 

GACATGTATO GCGGAGAGAT GCATGTCCGG CCGATGCTCT CTCAGCCAGC GTACTCTTTT 360 

TATCCGGAAG ATGAAATTCT TCACTTCTAC AAATGGACGT CGCCCCCAGG GGTGATCCGG 420 

ATCCTGTCTA TGCTCATTAT TGTGATGTGC ATCGCCATAT Thx:'CTGTGT GGCTTCCACA 480 

CTTGCTTGGG ACAGAGGCTA TGGGACAGGG CTCTTTOGAG GAAGCCTAAA CTACCCTTAT 54 0 

AGTCGCTTTQ GCTACGGAGG TGGCTATGGA GGCGGCTATG GAGGCTATGG CTATCGCTAT 600 

GGCGGATATA CAGACCCAAG AGCAGCCAAA^ GGCTTCCTGT TGGCCATCGC '^AGCCTTCKK 660 

TTCATCGCTT CCTTAGTAAT ATTTGTGACC AGTGTTATAA GATClfeGAAT GTCCAGGACA 720 

AGAAGATATT ACTTGATCGT GATCATAGTC AGCGCTATCC T6G0CATCAT GGTCTTTAlT 780 

GCCACGATCG TOTACATAAT GGGAGTGAAC CCGACG6CCC AGGCTTXHTSQ ATCTATGTA^ 840 

GGCTCACAGA TATATATGAT CTGCAACCAG TTTTATACtC CTGQAGGTAC TGGTCTCTAC 900 

GTGGATCAAT ATTTttTATCA CTArTGTGTO GTT^ AGGAGGCTAIt AGCCATTGTC 960 

CTGGGGTTCA TGATTATCGT GGCTTTTGCT TTAAtdATCT TTTTTGCTGi' GAAAACCCGA 1020 

AGAAAGATGG ATCGGTATGA TAAGTCCAAT ATTTTGTGGG ATAA6GAACA CATTTATGAT 1080 

GAACAGCCCC CCAATGTTGA AGAGTGGGTT AAAAATCTGT CTGCAGGCAC ACAGGAeATO ' 114 0 

CCTCCACCCC CATCTGACTA TGCGGAAA5A GTTGACAGTC CAATGGCCTA CTCCTCCAAT * 1200 

GGCAAAGTGA ATGGCAAGCG ATCATACCCA GAjGTCTTTCT ATAAGICAAC ACCTCTGGTG 1260 

CCTGAAGTGG CCCAGGAGAT TCCTCTOACC TTGAGtGTGG ATGAdh'icy^' GCAGCOTCdiG 1320 

TACAGCAGCA ATGOTAACCT AGAGACACCT TCTAAAAGGO CTCCCaSgaA GGGGAAAGC)^- 1306- ' 

GGAAAGGGCA AGAGGAOSGA CCCTGACCAC TATGAAACAG ACTACACGAC AGOTGGGGAG 1440 

TCCTGCGAGG AQCTGG7W3GA GGACTGGGTC AGGGAATATC CACCTATO^ TTCAGATCAA 1500 

CAAAGACAAC TCTACAAGAG AAATTTTGAT GCAGGTCTGC AGGAG^ATA/t GAGCTTACAG 1560 

GCAGAACTAG ACGACGTCAA TAAAGAGCTC TCTCGTCTAG ATAAAGAGCT OGATGACTAC 1620 

AGAGAGGAGA GTGAAGAGTA CATGGCTGCT GCTGATGAAT ATAATAGACT AAAGCAAGTT 1680 

AAGGGATCTG CAGATTATAA AAGTAAGAGG AATTACTGCA AGCAiSTTGAA GACdAAAlTA 1740 

TCGCACATCA AGAGGATGGT GGGAGACTAT GACAGACGGA AACCTIAGAG AGATGCCAGT 1800 

TGCGGGAGAA GGGAGAGGTG CATCTGCCTG CACGATGTCT CTGCAATTCT CTCCAGAGGC 1860 

AAACTGACTT TGGACTCTAA TCTGGGAAGT TAAAACTTTG TGATCATTAC AAAGTTTCCA 1920 

TGGCTTTAAT TCCATCAGTT TCCTATCTCC AGTATTGAAG CATTTTATAA ATGGCTTTTC 1980 
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ATAATTGACT GGGCTGAACA CTCCAATTAA GGATTTTACA GTTTCAACAT TGATTCTTij'r 204 0 

ATTAAGAATT AAAATGTTGC TTGAQGTnT AAATGTCAAG AAAGGTCCTG GTGTGAGCT3 2100 

* TGATGTGTGT GAGCTGTGAT GTGAAGGTTC ACACGCCAGG CAGC6TGTTC GTCCAGGTAG 2160 

ACCGTCTAAT CAATCTTTGC AGCAGCCCTC AGGTGACTGT TATTTAGAAT CAGGTrGTr: 2220 

TTGGTTTTCC AGACAGGGTr TCTCTGTGTA GCCCTGGCTQ ACCTAGAACT TACGCTGTJWS 2280 

10 ACCAGGCTGG CCTTGAACTC ACACAGCTCC TCTGAGTGCT GGTGCAGGAG TTAACGTCCr.:* 2340 

GGACCGC3TAT CATCACTm CCTGCGGTGA CTTCTCtlAAA CTGAAACTC5C TAA^GCAGI.t;* 2400 

TTGGCTAAGT CTGTTTTATG ACTGCAAATG ACAGCATTCC TOCCTTTGTA TTTCAGGGCyi. 2460 

^5 AATACGATAC ATTATATCGG CCATGTTCCC CACCACTGTT TTTCTTATAT TGACTTrT/JL ,2520 

CAAATGAATA GGATTATTTT TGGCTTTACA TTTTTTCCTA ACACTTAAGA TCATATAA;J!; 2580 

TTAApiyWWTA T>GTGAAATTT . AAGAATTGTA , AATATATATT . TACGTTTGAA AGATOATO T 264 0... 

aaatccaggg ttaaagtgct ttttatcttg tatagtttac ATGCimrr TrnrTTTGtf. 2700 

20 

TAi^CCCACTA GACCTTTCCA. TTGTATCAGA GTATQCAATT ACATTTApAA TTATGACnt 276,0 

AATTGTATTT CACAGGAATG CTCAAQITTT CTACATATTT TATAAGGTAT TAAACCTGJ.T 2&20 

GTTCTCTTTC TAAAAAAAA 2839 
(2) INFORMATION FOR SEQ ID NOr 7: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDWESS: single 

(D) TOPOLOGY: linear " ' . . 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - •Synthesis DNA** 



(xi) SEQOENCE DgSCIlIPTION: SEQ ID KOt 7: 
TATGAGACAG ACTACACAAC TGGCGGCGAG TCC 33 
40 (2) INFORMATION FOR SEQ ID NO:. 8; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENOTHr 30 base pairs - . 

(B) TYPE: nucleic acid 

(C) STRANDEDNeSS:. single 

(D) TbPdLOGt: linear ■ ' "v 
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30 



so 



(ii) MOLPOJLE TYPE; other nucJ|..^i,c acid 

(A) DESCRIPTION: /degc = "Synthesis DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
ATCATAGTCT CCAACCATCT TCTTGATGTG 30 
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Briel Desc;ription of the Drawings: ! 

Ftg. 1 is photographs substituting for drawings shownng the rmrphology of organtsms. Photo. A hi a stained immun* 
ofluoresoent photomicrograph of cuHured human intestinal eprthelia) cell strain T84 with an anti- human ocdudin ral 
6 monodonal antibody. Photo. 6 is a stained immunofluorescent photonrHcrograph of cultured human rttestinal epittielial 
cell strain T84 with a mouse monoclonal antibody against the TJ INng protein ZO-1 . The same siti»; of T84 were pho- 
tographed. 

Claims 

10 . ; V . 

1 . A DNA for encoding a human ocdudin protefn having an amino ackJ sequence as defined in S E O ID NO. 1 . 

2. The DNA for encoding a human dixiudin prote^ according to claim 1 which Is a DNA defined Ir SEQ ID NO. 4. 

rs 3. A DNA for encoding a protein havBig an arhino add sequence in' which one or plural amino adds in an amino add 
sequence defined in SEQ ID NO. 1 are added, deleted or substituted. 

4. A DNA defified in Sequence Na 5 fbr encoding a canine occludine prbteih having an aniind acl d Sequence defined 
in SEQ ID NO. 2. 

20 

5. A DNA defined in SEC ID NO. 6 for encocing a mouse ocdudin protein having an arhlho add f^qiience defined In 
SEQ ID Na 3. ^ 

6. A vector which comprises the DNA described in any one of dainis 1 to 5. 

7. A transfonnant which hokte the vector described in daim 6. 

8. A human occtudin protein which has an amino acid sequence defined in SEQ ID NO. 1 . 

so 9. A protein fiaving an amino add sequence In which one or plural amino adds tn an amino add sequence defined m 
SEQ ID NO. 1 are added, deleted or substituted. 

10. A partial peptide of a human ocdudin protisin which has an amino add sequence deffhed In SEKitD NO. 1. 

35 11. A canine ocdudin protein which has an amino add sequence defined In SEQ ID NO, 2. 

1 2. A mouse ocdudin protein which has an amino add sequence defined in ^EQ Id NO. 3. 

13. A method of manufacturing the protdn desatbed in any one of plalrns 8 to 11 which conprlses the steps of cuUi- 
40 vating the transfbrnrant described in Claim 7 and coiecting an e^qxessed'p^^ 

14. A DNA probe which comprises all or a part of a base sequence defined In SEQ ID NO. ^,$qrS 

1 5. A DNA primer which corrprises a part of a base sequence as defined In SEQ ID t<ilQ* 4* 5 6r 6. 

46 

16. A polyclonal antibody or a monodonal afftibody wtdqh specifically bihd^ to the pftrfein despitiied In any one of 
daims8»11 and 12. - ^ . ■ 

17. An analysis method of the DNA gene descrit>ed tn dalm 2, 4 or 5 in a biological specimen, where n the DNA primer 
60 described in daim T 5 is used. 

18. An analysis method of the DNA gene described in daim 2, 4 or 5 in a biological spedmen, whei c in the DNA probe 
descrtoed in daim 1 4 is used. 

55 1 9. A screening metixxJ of a drug affecting the expression of oodudia which comprises the steps ol i^llowing ocdudin- 
expressing cells and an anatyte to coexist and then determining an expression quantity of an oc>;iudin gene of the 
cells by the method of daim 17 or 18. 
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21 . An assay reagent for ocdudin in a biolOQical specimen, which comprises the antibody desa ted in daim 



16. 
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